ansaurus

Question

Search and replace hundreds of strings in tens of thousands of files?

Answer 1

+1 A:

I think your idea of putting the old/new names into a single location is a good one. It would certainly reduce the difficulty of maintaining and verifying the changes. It seems like this is the obvious answer, but I think that using any of the popular scripting languages such as ruby, python, perl, etc. would make this task fairly straightforward. The script could read in the file that has the old/new replacement information, construct the appropriate regular expressions from that, and then process the files that need the replacements.

The script could be written as a multi-threaded utility, although it doesn't seem like there would be a lot of benefit in this type of situation. If I understand the question, this should be basically a one-time usage so high performance does not seem like the top priority.

Mark Wilkins 2010-04-21 02:56:45

If I fail to find an available solution, I'll just to write my own. I'm comfortable in any of those scripting languages you specified. Oh, I forgot to add, that all those thousands of files are under version control too. :)And your point about the necessity of being multi-threaded is perfectly valid.

C Johnson 2010-04-21 03:25:20

Answer 2

A:

Will this (Wingrep) do the trick?

Jeremy 2010-04-21 02:59:58

It would, but it only searches for one thing at a time... in multiple files. So that is the case of: Replace one thing in N amount of files. I need to replace multiple things in multiple files.Ultra Edit (My desert island programming tool) can already do that very well.

C Johnson 2010-04-21 03:21:49

Answer 3

A:

in *nix, (or GNU win32) , you can use GNU find and sed together... eg

find /path -type f -name "*.c" -exec  sed -i.bak 's/^\#include.*["<\\/]stupid_name.*$/#include <dir\/new_name.h>/' "{}" +;

explanation,

the find command starts finding files (-type f) starting from /path. -name "*.c" searches for all .c files, then for each one found, do a sed to change the string to the new string. -i.bak asks sed to save the original file as backup before doing inplace editing. "{}" means the file passed to sed

ghostdog74 2010-04-21 03:13:34

Can you elaborate on what GNU win32 is?

C Johnson 2010-04-21 03:31:44

they are mostly *nix tools ported to windows. See here gnuwin32.sourceforge.net/packages.html

ghostdog74 2010-04-21 04:00:05

Thanks for the link.

C Johnson 2010-04-21 09:19:17

Could you add an explanation of all the little parts that make up your code sample would be great.

C Johnson 2010-04-21 09:24:17

Answer 4

+1 A:

I would use awk, a command line tool similar to sed.

mv file.x file.x.bak;
awk '{
  gsub( "#include \"bad_one.h\"" , "#include \"good_one.h\"" );
  gsub( "#include \"bad_two.h\"" , "#include \"good_two.h\"" );
}' file.x.bak > file.x;

Once you are at a terminal, use man awk to see more details.

drawnonward 2010-04-21 03:35:25

Thanks: that code is certainly terse and to the point, that's for sure. It certainly leaves a little bit of scripting to do around that snippet.

C Johnson 2010-04-21 09:08:08

Answer 5

+1 A:

Make a series of perl one-liners to edit the files in place, like so:

perl -i.bak -p -e 's/stupid_old_name/cool_new_name/' *.c

This has the added bonus of saving the originals of any changed files with a .bak extension.

I'd make a bunch of these, if I didn't know perl that well. I'd even put all the one-liners into a shell script, but then I'm not trying to impress any of the unix graybeards out there.

This website explains edit in place with perl very well: http://www.rice.edu/web/perl-edit.html

PS - Since I do know perl fairly well, I'd just write the was/is table in a "real" perl script and use it to open and parse all the files.

SDGator 2010-04-21 03:38:02

Thanks. I agree, I would have to write a real perl script if I was to realistically use this.

C Johnson 2010-04-21 09:17:40

Answer 6

+1 A:

Beta 2010-04-21 03:40:31

Good point about #2 above. I was so used to doing this manually that I neglected to think about that. When I did this previously (manually), I has having to rename the files by hand in perforce. That was very tedious. I'll have to script up that too.

C Johnson 2010-04-21 09:32:38

Answer 7

A:

PowerGREP can do that. It can search for multiple search strings (literal text or regular expressions) in any combination of files, and is multithreaded (starting with PowerGREP 4, the current version).

alt text

You can save your searches for later re-use, too.

Tim Pietzcker 2010-04-21 06:31:16

Thanks, looks very interesting.

C Johnson 2010-04-21 09:22:47

Ah, a deal breaker:From their website: "Most grep tools can only search for a single regular expression. With PowerGREP you can use up to five sets of search terms."

C Johnson 2010-04-21 09:28:31

I have tried it and successfully entered dozens of search terms (>>100), so perhaps that limitation is no longer there. Are the names and their replacements that individually different that you really need hundreds of different regexes? I find that hard to believe.

Tim Pietzcker 2010-04-21 10:01:17

Thanks: That is good to know it will handle many searches.

C Johnson 2010-04-21 16:00:00

ansaurus

tags:

views:

answers:

Search and replace hundreds of strings in tens of thousands of files?

related questions