ansaurus

Question

AWK program to find the average rainfall of three states

Answer 1

+2 A:

your regexp should be

/ CA / {CA++; cA_SUM+= $5} # ^CA$ - Regular Expression to match the word CA only 
/ TX / {TX++; TX_SUM+= $5} # ^TX$ - Regular Expression to match the word TX only  
/ AX / {AX++; AX_SUM+= $5} # ^AX$ - Regular Expression to match the word AX only

/^AX$/ match only if it is the only word in the line

HTH!

EDIT

/ CA / {CA++; CA_SUM+= $5} # ^CA$ - Regular Expression to match the word CA only 
/ TX / {TX++; TX_SUM+= $5} # ^TX$ - Regular Expression to match the word TX only  
/ AX / {AX++; AX_SUM+= $5} # ^AX$ - Regular Expression to match the word AX only 
END {

 if(CA!=0){CA_avg = CA_SUM/CA;     printf("CA Rainfall: %5.2f",CA_avg);}
 if(TX!=0){TX_avg = TX_SUM/TX;     printf("TX Rainfall: %5.2f",TX_avg);}
 if(AX!=0){TX_avg = AX_SUM/CA;     printf("AX Rainfall: %5.2f",AX_avg);}
}

belisarius 2010-10-16 21:13:06

@belisarius - does not work - I see no output again .

Eternal Learner 2010-10-16 21:19:21

@Eternal try remuving your FS from the comand line

belisarius 2010-10-16 21:32:10

@belisarius: Gives me a division by zero error

Eternal Learner 2010-10-16 21:32:55

@eternal wait ... testing

belisarius 2010-10-16 21:33:53

@belisarius: I tried something like this and I got a division by zero error BEGIN { FS = "\t" } ; /\\tCA\\t/ {CA++; cA_SUM+= $5} # ^CA$ - Regular Expression to match the word CA only /\\tTX\\t/ {TX++; TX_SUM+= $5} # ^TX$ - Regular Expression to match the word TX only /\\tAX\\t/ {AX++; AX_SUM+= $5} # ^AX$ - Regular Expression to match the word AX only END { CA_avg = CA_SUM/CA; TX_avg = TX_SUM/TX; AX_avg = AX_SUM/AX; printf("CA Rainfall: %5.2f",CA_avg); printf("CA Rainfall: %5.2f",TX_avg); printf("CA Rainfall: %5.2f",AX_avg); }

Eternal Learner 2010-10-16 21:42:15

@Eternal Working now .. don't post code in comments :)

belisarius 2010-10-16 21:48:01

@Eternal It's running at http://ideone.com/tcHg1

belisarius 2010-10-16 21:49:51

@belisarius : Hey I changed it to something like below and it work sBEGIN { FS = "\t" } ;/ CA / {CA++; CA_SUM+= $5} # CA - Regular Expression to match the word CA only/ TX / {TX++; TX_SUM+= $5} # TX - Regular Expression to match the word TX only/ AK / {AK++; AK_SUM+= $5} # AK - Regular Expression to match the word AX onlyEND { CA_AVG = CA_SUM/CA; TX_AVG = TX_SUM/TX; AK_AVG = AK_SUM/AK; printf("CA Rainfall: %f",CA_AVG); printf("TX Rainfall: %f",TX_AVG); printf("AK Rainfall: %f",AK_AVG); }Thanks for your help

Eternal Learner 2010-10-16 22:03:57

Answer 2

+3 A:

The pattern /^CA$/ means the characters "C" and "A" are the only characters on the line. You want:

$2 == "CA" {CA++; CA_SUM+= $5}
# etc.

However, this is DRYer:

{ count[$2]++; sum[$2] += $5 }
END {
    for (state in count) {
        printf("%s Rainfall: %5.2f\n", state, sum[state]/count[state])
    }
}

Also, this looks wrong: awk 'FS="\t"'-f awk1.awk rainfall.txt
try: awk -F '\t' -f awk1.awk rainfall.txt

Response to comments:

awk -F '\t' -v month=2 -v states="CA,AZ,TX" '
    BEGIN {
        month_col = month + 3  # assume January is month 1
        split(states, wanted_states, /,/)
    }
    { count[$2]++; sum[$2] += $month_col }
    END {
        for (state in wanted_states) {
            if (state in count) {
                printf("%s Rainfall: %5.2f\n", state, sum[state]/count[state])
            else
                print state " Rainfall: no data"
        }
    }
' rainfall.txt

glenn jackman 2010-10-16 23:55:10

+1 for a more general solution and mentioning DRY in the context of rain.

schot 2010-10-18 06:42:47

+1 Much better than mine. I was thinking only in correcting the OP errors, which begets always a shortsighted answer. You may improve it a bit more by allowing a parameter in the command line for the month number. Just my 2 cents.

belisarius 2010-10-20 22:00:00

You could change your DRY version to select particular states: `awk -v statelist="AK CA TX" 'match(statelist,$2){ count[$2]++; sum[$2] += $5 } ...`. Or use a shell variable instead of the literal `states="AK CA TX"; awk -v statelist=$states '...'`

Dennis Williamson 2010-10-26 02:35:53

ansaurus

tags:

views:

answers:

AWK program to find the average rainfall of three states

related questions