ansaurus

Question

Answer 1

A:

do it another way

$ echo 'a001~!+rr001~!+1~!+TEST DATA 1' | awk -F"+" '{gsub(/~!$/,"",$2);print $2}'
rr001

or this

$ echo  'a001~!+rr001~!+1~!+TEST DATA 1' | awk -F"[~][!][+]" '{print $2}'
rr001

or

$ echo  'a001~!+rr001~!+1~!+TEST DATA 1' | awk -F'~!\\+' '{print $2}'
rr001

ghostdog74 2009-11-07 15:25:55

The first proposed solution is not a good idea - for multiple reasons. The second will work, but is overkill compared to fixing the regex to '`~!\+`'. Using '-F' is nicer than setting FS in a BEGIN block, but you should use single quotes around it rather than double quotes - especially when it contains a backslash.

Jonathan Leffler 2009-11-07 15:33:18

can you then show the code and results of using '~!\+' ?

ghostdog74 2009-11-07 15:45:39

"The first proposed solution is not a good idea" -- agreed indeed!!

Xolve 2009-11-07 16:51:47

Answer 2

+1 A:

Your problem is that your match criteria '~!+' is a regular expression.

From the documentation: "+ This symbol is similar to ‘*’, except that the preceding expression must be matched at least once. This means that ‘wh+y’ would match ‘why’ and ‘whhy’, but not ‘wy’, whereas ‘wh*y’ would match all three of these strings."

So essentially you are asking to match ~! or ~!!, etc. So you are not matching on the + at all. This is why you see the + in the output. You should be able to use '~!\\+' to get your expression to work

Chris Dail 2009-11-07 15:27:25

Answer 3

+1 A:

 $ echo -n 'a001~!+rr001~!+1~!+TEST DATA 1' | awk 'BEGIN {FS="~!\\+"} {print $2}' 
rr001

Double escaping also seems to do the job.

2009-11-07 15:33:37

ansaurus

tags:

views:

answers:

AWK: field separator contains a '+'

related questions