I have several Google calendars that I'd like to merge and place on my windows desktop using Samurize. I've tried using Samurize's Page Scraper plugin, but it doesn't appear to be up to the task.
I can get Samurize to run a script and place it's output on the desktop, but I'm not sure what the best tools are to do this.
All the URLs I have are of the form:
http://www.google.com/calendar/feeds/example%40gmail.com/private-REMOVED/basic?futureevents=true&orderby=starttime&sortorder=ascending&singleevents=true
So I could fetch them using curl, but then I need to filter them.
I want something that looks like:
2009 12 02 Event from calendar 1's description
2009 12 03 Event from calendar 2's description
2009 12 04 Event from calendar 1's description
2009 12 05 Event from calendar 3's description
2009 12 06 Event from calendar 1's description
However the dates in the calendar feeds are formatted like this:
<title type='html'>Event from calendar 1's description</title><summary type='html'>When: Fri 5 Dec 2008<br>
So how do I filter out the dates and descriptions, and convert the dates?
(I have cygwin installed so something using perl or sed/awk would be perfect as I'm familiar enough with them that I'd be confident about altering them in future, but I'm open to suggestions.)