ansaurus

Question

awk search on multiple fields of a multi line record file

Answer 1

+2 A:

if you have set FS="\n", and RS="", then the first field $1 would be SMS-MO-FSM. Therefore your awk code is

awk 'BEGIN{FS="\n"; RS=""} $2~/country.*MO/ && $1~/SMS-MO-FSM/ ' file

ghostdog74 2010-08-09 09:35:21

Watch out with unwanted regex matches (like `country: SMO`). I would use string comparison whenever possible and anchor all regex.

schot 2010-08-09 09:45:59

adaptive 2010-08-09 10:03:48

when piping to sort, you need newlines.. i don't know how to answer your question since you don't provide enough information on your data. try setting OFS="\n" and see.

ghostdog74 2010-08-09 10:35:44

Answer 2

+2 A:

(I post this as a separate answer instead of a comment reply for better formatting)

Concerning your second remark about printing a record on a single line: When you don't modify your records OFS and ORS have no effect. Only when you change $0 or one of the fields awk will recompute NF and reconstruct $0 based on $1 OFS $2 OFS ... $NF ORS. You can force this reconstruction like this:

BEGIN {
    FS  = "\n"
    RS  = ""
    OFS = ";"     # Or another delimiter that does not appear in your data
    ORS = "\n"
}
$2 ~ /^[ \t]*country:[ \t]*MO[ \t]*$/ && $1 ~ /^[ \t]*SMS-MO-FSM[ \t]*$ {
    $1 = $1 ""    # This forces the reconstruction
    print
}

schot 2010-08-09 10:54:27

brilliant. thank you.

adaptive 2010-08-10 07:00:49

ansaurus

tags:

views:

answers:

awk search on multiple fields of a multi line record file

related questions