ansaurus

Question

Answer 1

+2 A:

sed 's/,,,/replacement/' < old-file.csv > new-file.csv

optionally followed by mv new-file.csv old-file.csv

wnoise 2009-01-19 22:21:16

O.M.G! Kickin' it old skool! It makes me feel ooolllldddd. :-)

Peter Rowell 2009-01-19 22:26:07

doesn't remove the line... see David's for better use of sed

orip 2009-01-19 23:01:05

It asked for replacement, not removal when I answered.

wnoise 2009-01-20 00:57:43

@Peter, it's not old school it's classic@orip it's a fine use of sed

jskulski 2009-01-20 05:20:01

In wnoise's defense, it did say replace at first; see the edit history. And this sed usage is portable across platforms; notations using '-i' are specific to GNU sed (and hence valid for the question which is about files on Linux).

Jonathan Leffler 2009-01-21 07:30:32

Answer 2

+1 A:

Replace or remove, your post is not clear... For replacement see wnoise's answer. For removing, you could use

awk '$0 !~ /,,,/ {print}' <old-file.csv > new-file.csv

Keltia 2009-01-19 22:27:02

Answer 3

+5 A:

It depends on what you mean by replace. If you mean 'remove', then a trivial variant on @wnoise's solution is:

grep -v '^,,,$' old-file.csv > new-file.csv

Note that this deletes just those lines with exactly three commas. If you want to delete mal-formed lines with any number of commas (including zero) - and no other characters on the line, then:

grep -v '^,*$' ...

There are endless other variations on the regex that would deal with other scenarios. Dealing with full CSV data with commas inside quotes starts to need something other than a regex machine. It can be done, within broad limits, especially in more complex regex systems such as PCRE or Perl. But it requires more work.

Check out Mastering Regular Expressions.

Jonathan Leffler 2009-01-19 22:28:32

Answer 4

+1 A:

Do you want to replace them with something, or delete them entirely? Either way, it can be done with sed. To delete:

sed -i -e '/^,\+$/ D' yourfile1.csv yourfile2.csv ...

To replace: well, see wnoise's answer, or if you don't want to create new files with the output,

sed -i -e '/^,\+$/ s//replacement/' yourfile1.csv yourfile2.csv ...

or

sed -i -e '/^,\+$/ c\
replacement' yourfile1.csv yourfile2.csv ...

(that should be entered exactly as is, including the line break). Of course, you can also do this with awk or perl or, if you're only deleting lines, even grep:

egrep -v '^,+$' < oldfile.csv > newfile.csv

I tested these to make sure they work, but I'd advise you to do the same before using them (just in case). You can omit the -i option from sed, in which case it'll print out the results (rather than writing them back to the file), or omit the output redirection >newfile.csv from grep.

EDIT: It was pointed out in a comment that some features of these sed commands only work on GNU sed. As far as I can tell, these are the -i option (which can be replaced with shell redirection, sed ... <infile >outfile ) and the \+ modifier (which can be replaced with \{1,\} ).

David Zaslavsky 2009-01-19 22:32:20

nice - 'sed -i' rocks

orip 2009-01-19 23:01:40

Some of your 'sed' options are not portable (GNU sed specific). Not a major problem as long as you're aware of that.

Jonathan Leffler 2009-01-19 23:18:16

@Johnathan: true, I only ever use GNU sed and I tend to forget about its extensions unless I'm actually staring at the info page. Thanks.

David Zaslavsky 2009-01-20 00:57:22

@Dahvid :D The question did say "Linux file system" - your answer is valid given that constraint.

Jonathan Leffler 2009-01-20 01:06:07

Answer 5

+1 A:

What about trying to keep only lines which are matching the desired format instead of handling one exception ?

If the provided input is what you really want to match:

grep -E '[a-z],[a-z],[a-z],[a-z]' < oldfile.csv > newfile.csv

If the input is different, provide it, the regular expression should not be too hard to write.

MatthieuP 2009-01-19 23:27:22

Answer 6

+1 A:

Most simply:

$   grep -v ,,,, oldfile > newfile   
$   mv newfile oldfile

Brendan Dowling 2009-01-20 03:26:10

Only 3 commas in pattern to be removed. :D

Jonathan Leffler 2009-01-26 04:39:57

Answer 7

A:

yes, awk or grep are very good option if you are working in linux platform. However you can use perl regex for other platform. using join & split options.

2009-01-20 04:36:43

Why split and join? Yes, you can certainly use perl. But the basic loop would be using a regex to match or not match the lines to be printed - I don't see the join/split operation. Even a replacement instead of a delete probably wouldn't use join or split.

Jonathan Leffler 2009-01-21 07:33:25

ansaurus

tags:

views:

answers:

Replacing a line in a csv file?

related questions