Greetings,
I have a large dataset (1GB of pure compressed text).
Right now I'm rewriting the dataset based on information in the data, for example:
- Turn 2009-10-16 into Friday
- Count the number of times something happen and how long they last for
Right now I'm doing all this in Java. I'm wondering if anyone knows of a tool or language which was actually designed to do this type of work. It is possible in Java but I'm writing a lot of boiler plate code.
Thanks