views:

93

answers:

2

Does anyone know if there are standards / api to crawl news articles from most of the biggest news sources.

I'm using rss to index them but I would like to classify them with more data than just their titles.

+1  A: 

If I read correctly, you mean Atom?

There's also RSS specifications

Trick
well most of the news sources don't add the main content of their articles in the feeds no ?
mnml
Sadly, no... You'll have to write your own crawler/spyder to get everything.
Trick
A: 

Using Api

http://www.daylife.com

http://spotlight.reuters.com

mnml