ansaurus

Question

Parsing a blogspot XML file with Nokogiri

Answer 1

A:

Your code works for me. There were some problems with certain version of Nokigiri.

I get:

 Content
 Content

I'm using nokogiri (1.4.1 x86-mswin32)

Nigel Thorne 2010-07-20 03:42:59

thanks nigel - it turned out that i needed to be very very specific with my xpath expressions - or cull away at un needed attributes :D

meilas 2010-07-20 04:02:37

Answer 2

A:

turns out that i had to delete the attributes for feed

<feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'&gt;

meilas 2010-07-20 04:01:51

Answer 3

A:

I just stumbled on this question. The issue appears to be XML namespaces:

"turns out that i had to delete the attributes for feed"

<feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'&gt;

XML Namespaces complicate accessing nodes because they provide a way to separate similar tags. Read the "Namespaces" section of Searching an HTML / XML Document.

Nokogiri also has the remove_namespaces! method which is a sometimes-useful way of dealing with the problem but has some downsides too.

Greg 2010-10-30 20:47:41

ansaurus

tags:

views:

answers:

Parsing a blogspot XML file with Nokogiri

related questions