ansaurus

Question

egrep regex not working on regex that works in other program.

Answer 1

+2 A:

ERE does not support \s. Use [[:space:]] for \s, or simply .

~~It seems the ^ points to the start of text that is not yet matched. I don't know why.~~ (This behavior is true on grep (GNU grep) 2.5.1 on Mac OS X only?)

The regex matches the header lines because all elements in the regex is optional. You need to change some of those * into +.

Since the file is in fixed-length format, it is far easier to use cut than constructing a regex.

cut -c 1-20 highly.txt

You could use grep -v to filter out the undesired results.

KennyTM 2010-10-23 17:24:05

Noted, problem still here. How do I force it to match the beginning of line only? It seems like egrep doesn't care that I added ^

mna 2010-10-23 17:34:08

Noted, but this doesn't get rid of the re-occurring headers

mna 2010-10-23 17:49:53

Answer 2

+1 A:

Try adding a -o option to grep to make it print only the part that matched the pattern instead of the line that has the pattern:

egrep -o -e  "^[[:space:]]*[0-9]*[[:space:]]*[0-9.e+]*" file
      ^^

Working link

Alternatively you can use sed as:

sed -r 's/^\s*([0-9]+)\s*([0-9.e+]+).*/\1 \2/' file

codaddict 2010-10-23 17:57:55

Thanks. Can you tell me what tool I could use to do something like "^[[:space:]]*([0-9]*)[[:space:]]*([0-9.e+]*)" -output "\1,\2" ? I'm new to the whole bash :S

mna 2010-10-23 18:06:44

That would be `sed`. I'll update the answer with it.

codaddict 2010-10-23 18:10:19

Answer 3

A:

if you have data that looks properly formatted, with delimiters that you can identify (eg in your case, tabs/spaces), there is no need to use regex. Use awk.

awk '!/--/&&$1!="no"{print $1,$2}' file

I believe this one liner is all you need since you said you want to get the first 2 columns and skip the headers. you can use cut too, but its not as flexible as awk.

ghostdog74 2010-10-23 22:13:17

how do I suppress the 'no-number' lines awk returns?

mna 2010-10-23 23:10:50

the one liner already does that. See that `$1!="no"` ?

ghostdog74 2010-10-24 02:05:54

$1!=" no" white spaces :)

mna 2010-10-25 20:26:16

ansaurus

tags:

views:

answers:

egrep regex not working on regex that works in other program.

related questions