ansaurus

Question

search by column (not field) number with awk

Answer 1

A:

why dont you use if else?

like below algo:

if $5 is not blank
{ 
if $6==temp print $0
}
else if $7==temp print $0

It would also be more easy to understand if you provide some sample input!

Vijay Sarathi 2010-08-30 10:39:11

Sorry!A typical input line is HETATM 5307 S MOY A 602 14.660 14.666 109.556 1.00 26.41 S and from time to time the A (or whatever character is in this position) gets left out. The format is defined by column number so this would be less prone to error

Chris 2010-08-30 10:47:48

Answer 2

A:

awk -F"[ ]" -v temp=${het} '$6==temp' file

ghostdog74 2010-08-30 11:02:27

this has getting beyond the reaches of my awk knowledge and I don't think I'm seeing the full meaning of the "[ ]" field separator - could you explain that one?

Chris 2010-08-30 11:18:39

please see schot's answer :)

ghostdog74 2010-08-30 12:25:10

Answer 3

+1 A:

Please add the sample input to your question, not to a comment. It is still not clear how your input looks like. Given your 'normal' input line:

HETATM 5307 S MOY A 602 14.660 14.666 109.556 1.00 26.41 S

Which of the following two matches your input with 'field 5 is blank':

HETATM 5307 S MOY  602 14.660 14.666 109.556 1.00 26.41 S  
HETATM 5307 S MOY   602 14.660 14.666 109.556 1.00 26.41 S

In the first case, ghostdog74's answer should work. The -F"[ ]" he uses is a clever way of splitting on single spaces only. -F" " does not work, because then awk uses its default whitespace splitting.

If your data is of the second format, I would use substr() to extract the correct field:

 awk -v temp=${het} 'substr($0, 20, 3) == temp'

Another option could be using gawk's fixed-width splitting, but it really depends on the exact format of your input.

schot 2010-08-30 12:09:57

Thanks everyone for your help. The gawk pointer is a big help and I'll give the fieldwidth man page entry a look and use that. Should be OK from here

Chris 2010-08-30 14:54:48

Answer 4

A:

Based on schot's suggestion and your example data:

awk -v FIELDWIDTHS="6 1 4 2 1 3 3 1 1 1 3" '{print $11}'

The final "3" in FIELDWIDTHS represents the field that contains "602". I've omitted field widths for the rest of the line. Some of the field widths could be combined, but I didn't know what was whitespace as delimiters versus whitespace as field contents.

Dennis Williamson 2010-08-30 14:40:59

ansaurus

tags:

views:

answers:

search by column (not field) number with awk

related questions