ansaurus

Question

feedparser and Google News

Answer 1

+1 A:

First you need to check out RSS Specification. And here is a feed parser. That should get you started.

David Basarab 2009-11-04 02:46:31

Answer 2

+2 A:

Have you examined the feed from Google News?

There is a root element in each feed which contains a bunch of information and the actual entries dict. Here's a dirty way to see what's available:

import feedparser
d = feedparser.parse('http://news.google.com/news?pz=1&amp;cf=all&amp;ned=ca&amp;hl=en&amp;topic=w&amp;output=rss')

print [field for field in d]

From what we can see we have an entries field which most likely contains .. news entries! If you:

import pprint
pprint.pprint(entry for entry in d['entries'])

We get some more information :) That will show you all the fields related to each entry in a pretty printed manner (that's what pprint is for)

So, to fetch all the titles of our news entries from this feed:

titles = [entry.title for entry in d['entries']

so, play around with that. Hopefully that's a helpful start

Bartek 2009-11-04 02:50:01

Humm... I played around with this a bit. Apparently this rss give only a summary, not the full text of the news. :(

Rafael S. Calsaverini 2009-11-04 03:13:31

Answer 3

A:

is there any existing library will do the same function which feed parser is doing?

GoldGod 2009-12-07 18:12:20

ansaurus

tags:

views:

answers:

feedparser and Google News

related questions