ansaurus

Question

How to parse the "<media:group>" using feedparser?

Answer 1

A:

You can parse the feed using

feed = feedparser.parse(your_feeds_url)

and then access your xml elements using either python's attribute access or dictionary-like access on feed and its subelements. The former method won't work for an element name like media:content, so use the latter method.

The rest should become clear after studying the examples at http://www.feedparser.org

jellybean 2010-03-17 12:59:28

I print the content of the feed, it do not contain the information of media:content. I think feedparser skip to parse it.This is the RSS URL: http://www1.voanews.com/templates/Articles.rss?sectionPath=/learningenglish/home

Mingo 2010-03-17 14:48:02

Answer 2

+2 A:

feedparser 4.1 as available from PyPi has this bug.

the solution for me was to get the latest feedparser.py (4.2 pre) from the repository.

svn checkout http://feedparser.googlecode.com/svn/trunk/ feedparser-readonly
cd feedparser-readonly
python setup.py install

now you can access all mrss items

>>> import feedparser  # the new version!
>>> d = feedparser.parse(MY_XML_URL)
>>> for content in d.entries[0].media_content: print content['url']

should do the job for you

captnswing 2010-06-30 14:57:25

ansaurus

tags:

views:

answers:

How to parse the "<media:group>" using feedparser?

related questions