ansaurus

Question

Problem with namespace while parsing XML file in C#

Answer 1

+2 A:

Well, welcome at SO then ;-)

In XML, a namespace declaration is saint. Removing it may well make the XML unusable, so I'd advice against it (and it's a huge task on a 2.8GB file!). Each name should be considered unique as in {namespace}elementname (i.e, both) whenever you deal with XML. Linq to XML accepts namespaces and you should use them:

XNamespace wiki = "http://www.mediawiki.org/xml/export-0.4/";

var text = from el in StreamXmlDocument(filePath)
           where el.Element(wiki + "title").Value.Contains(titleToSearch)
           select (string)el.Element(wiki + "revision").Element(wiki + "text");

(may be ignored, you do this already):
A note on the XML: Linq2XML will load the whole thing in memory, I believe, just like DOM, which will require about 4.5 times the size of the file. This may be problematic. Read this MSDN blog about streaming Linq to XML.

Abel 2010-07-23 16:41:32

Thanks, yes I know about memory issues, that's why I use XmlReader. It reads only one Element at a time to memory :) Thanks for respond. I'll check it now

Ventus 2010-07-23 16:49:32

Great! This works fine. Thanks again :)

Ventus 2010-07-23 16:56:49

Answer 2

+1 A:

I believe you want:

XNamespace ns = "http://www.mediawiki.org/xml/export-0.4/";

var text = from el in StreamXmlDocument(filePath)
           where el.Element(ns+"title").Value.Contains(titleToSearch)
           select (string)el.Element(ns+"revision").Element(ns+"text");

James Curran 2010-07-23 16:44:15

how equal can we be ;-) Just trying to be picky: the last `Element`, you probably want `Element(ns + "text")`

Abel 2010-07-23 16:47:53

D'oh! And I was thinking of using "wiki" for the namespace variable...

James Curran 2010-07-23 16:58:22

ansaurus

tags:

views:

answers:

Problem with namespace while parsing XML file in C#

related questions