ansaurus

Question

Which Ruby XML library would you recommend for a 2.4MB XML file?

Answer 1

+1 A:

I have used libXML before for xml parsing, it has a nice API and is fast.

MatthewFord 2008-09-24 10:14:56

Answer 2

A:

Maybe you could distill the XML with a xslt stage prior running in Ruby?

epatel 2008-09-24 10:18:39

I took a course on XSLT a few years back - I still wake up screaming some nights. My ageing brain doesn't equate it with "easy to use" I'm afraid.

Mike Woodhouse 2008-09-24 10:32:01

Ha...I agree, but some people like it.

epatel 2008-09-25 08:31:14

you get a vote for that comment! :)

epatel 2008-09-25 08:33:18

Answer 3

A:

Take the one that offers full XPath support and has some samples that get you started immediately ;)

VVS 2008-09-24 10:28:37

Answer 4

+4 A:

RubyInside had an article about that recently. Check it out.

webmat 2008-09-24 21:30:17

Answer 5

+3 A:

Hpricot is probably the best tool for you -- it is easy to use and should handle 2mg file with no problem.

Speedwise libxml should be the best. I used libxml2 binding for python few months ago (at that moment rb-libxml was stale). Streaming interface worked the best for me (LibXML::XML::Reader in ruby gem). It allows to process file while it is downloading, is a bit more userfriendly than SAX and allowed me to load data from 30mb xml file from internet to a MySQL database in a little more than a minute.

dimus 2008-09-28 21:05:31

Answer 6

+2 A:

Nokogiri wraps libxml2 and libxslt with a clean, Rubyish API that supports namespaces, XPath and CSS3 queries. Fast, too. http://nokogiri.org/

Thomas 2009-09-18 11:40:51

ansaurus

tags:

views:

answers:

Which Ruby XML library would you recommend for a 2.4MB XML file?

related questions