ansaurus

Question

Can this be done faster (read file, substitute [sed], write new file)

Answer 1

+3 A:

This is slow because of the loop in bash. If you can get sed/awk/perl/etc to do the loop, it will be much faster. I can't see how you can do it in sed or awk though. It's probably pretty easy for perl, but I dont know enough perl to answer that for you.

At the very least, you should be able to save a little time by refactoring what you have to:

sed 's,[0-9A-Z]\{2\},\\\\x&,g' ${in_file} | while read line; do
 printf '%b\000\000' ${line} 
done >> ${out_file}

At least this way, you're running printf once per iteration and opening/closing ${out_file} once only.

camh 2010-09-12 10:53:12

+1 for pointing out that multiple redirections to the same file is slower than just one (read: having common sense).

amphetamachine 2010-09-12 11:01:03

You mean it should be **printf '%b${line}\000\000'** because **'\000\000'** becomes after **printf "%b" ${line}**

Robertico 2010-09-12 11:07:42

@Robertico: No, I meant it as I wrote it. '%b\000\000' is the format string, ${line} is the argument consumed by %b.

camh 2010-09-12 11:18:50

@camh: Thx, I'll give it a try.

Robertico 2010-09-12 11:24:46

Answer 2

+2 A:

Switch to a full programming language? Here's a Ruby one-liner:

ruby -ne 'print "#{$_.chomp.gsub(/[0-9A-F]{2}/) { |s| s.to_i(16).chr }}\x00\x00"'

llasram 2010-09-12 10:56:27

@llasram: Thx, but i love it here :-)

Robertico 2010-09-12 11:16:16

Answer 3

+4 A:

You need xxd command that comes with Vim.

export LANG=C
sed 's/$/0000/' ${in_file} | xxd -r -ps > ${out_file}

LatinSuD 2010-09-12 11:12:08

@LatinSuD: Thx, I'll give it a try.

Robertico 2010-09-12 11:26:00

+1: I had never considered that `xxd` could be used in reverse!

Johnsyweb 2010-09-12 11:26:57

@LatinSuD: Thx !!. You're the winner.

Robertico 2010-09-12 14:03:11

Answer 4

A:

if you have Python and assuming data is simple

$ cat file
99
AB

script:

o=open("outfile","w")
for line in open("file"):
    s=chr(int(line.rstrip(),16))+chr(000)+chr(000)
    o.write(s)
o.close()

ghostdog74 2010-09-12 11:28:39

Answer 5

+1 A:

And the winner is:

sed 's,[0-9A-Z]\{2\},\\\\x&,g' ${in_file} | while read line; do
    printf "%b" ${line} >> ${out_file}
    printf '\000\000' >> ${out_file}
done

real 44m27.021s
user 29m17.640s
sys 15m1.070s

sed 's,[0-9A-Z]\{2\},\\\\x&,g' ${in_file} | while read line; do
    printf '%b\000\000' ${line} 
done >> ${out_file}

real 18m50.288s
user 8m46.400s
sys 10m10.170s

export LANG=C
sed 's/$/0000/' ${in_file} | xxd -r -ps >> ${out_file}

real 0m31.528s
user 0m1.850s
sys 0m29.450s

Robertico 2010-09-12 13:31:04

ansaurus

tags:

views:

answers:

Can this be done faster (read file, substitute [sed], write new file)

And the winner is:

related questions