ansaurus

Question

Humanized dates with awk?

Answer 1

A:

Gawk has strftime(). You can also call the date command to format them (man). Linux Forums gives some examples.

mcandre 2009-07-09 14:58:27

Answer 2

+1 A:

if you are using gawk

awk 'BEGIN{
    s="03/05/2009"
    m=split(s,date,"/")
    t=date[3]" "date[2]" "date[1]" 0 0 0"
    print strftime("%b %d",mktime(t))
}'

the above is just an example, as you did not show your actual code and so cannot incorporate it into your code.

ghostdog74 2009-07-09 15:00:00

See my other comment on Dennis's solution, but strftime("%b %e",mktime(t)) is actually closer to what I wanted.

dtjohnso 2009-07-10 12:58:31

Answer 3

+1 A:

Why don't you prepend your awk-date to the original date? This yields a sortable key, but is human readable.

(Note: to sort right, you should make it yyyymmdd)

If needed, cut can remove the prepended column.

xtofl 2009-07-09 15:00:51

Answer 4

+2 A:

I get testy when I see someone using grep and awk (and sed, cut, ...) in a pipeline. Awk can fully handle the work of many utilities.

Here's a way to clean up your updated code to run in a single instance of awk (well, gawk), and using sort as a co-process:

gawk '
    BEGIN {
        IGNORECASE = 1
    }
    function mon2num(mon) {
        return(((index("JanFebMarAprMayJunJulAugSepOctNovDec", mon)-1)/3)+1)
    }
    / E[DS]T [[:digit:]][[:digit:]][[:digit:]][[:digit:]]/ {
        month=$2
        day=$3
        year=$6
        date=sprintf("%4d%02d%02d", year, mon2num(month), day)
        total[date]++
        human[date] = sprintf("%3s %2d, %4d", month, day, year)
    }
    END {
        sort_coprocess = "sort"
        for (date in total) {
            print date |& sort_coprocess
        }
        close(sort_coprocess, "to")
        print "Date\tCount"
        while ((sort_coprocess |& getline date) > 0) {
            print human[date] "\t" total[date]
        }
        close(sort_coprocess)
    }
' original.txt

glenn jackman 2009-07-09 19:42:06

Thanks! I wondered if there was a way to do this all in [g]awk, but I'm obviously not good enough with it. I like your year-indifferent match pattern too.

dtjohnso 2009-07-09 19:53:18

another way is to use gawk's own asort or asorti() routine

ghostdog74 2009-07-10 00:01:32

Answer 5

+2 A:

Use awk's sort and date's stdin to greatly simplify the script

Date will accept input from stdin so you can eliminate one pipe to awk and the temporary file. You can also eliminate a pipe to sort by using awk's array sort and as a result, eliminate another pipe to awk. Also, there's no need for a coprocess.

This script uses date for the monthname conversion which would presumably continue to work in other languages (ignoring the timezone and month/day order issues, though).

The end result looks like "grep|date|awk". I have broken it into separate lines for readability (it would be about half as big if the comments were eliminated):

grep -i "E[DS]T 2009" original.txt | 
date -f - +'%Y %m %d' | #reformat dates as YYYYMMDD for future sort
awk ' 
BEGIN { printf "%s\t%s\r\n","Date","Count" }

{ ++total[$0] #pump dates into associative array }

END {
    idx=1
    for (item in total) {
        d[idx]=item;idx++ # copy the array indices into the contents of a new array
    }
    c=asort(d) # sort the contents of the copy
    for (i=1;i<=c;i++) { # use the contents of the copy to index into the original
        printf "%s\t%2.d\r\n",strftime("%b %e, %Y",mktime(d[i]" 0 0 0")),total[d[i]]
    }
}'

Dennis Williamson 2009-07-09 20:36:48

Very nice! Did you intend to also include this?: BEGIN { printf "%s\t%s\r\n","Date","Count" }

dtjohnso 2009-07-09 21:05:38

Oops, forgot the header. Fixed.

Dennis Williamson 2009-07-09 21:48:27

One other thing, what I really wanted for the output was what I get with strftime("%b %e, %Y"... not strftime("%b %d, %Y"... An easy enough fix though.

dtjohnso 2009-07-10 12:57:16

Fixed .

Dennis Williamson 2009-07-10 15:29:58

ansaurus

tags:

views:

answers:

Humanized dates with awk?

Use awk's sort and date's stdin to greatly simplify the script

related questions