ansaurus

Question

PHP - Extract a chunk of XML from a larger XML document

Answer 1

+1 A:

preg_match('`<guest>.*</guest>`is', $xml, $matches);
print_r($matches);

Kamil Szot 2009-12-15 16:35:28

Answer 2

+2 A:

$reader = new XMLReader();
$reader->xml($xml_str);
$reader->read();
$inner = $reader->readInnerXML();

// $inner is your desired xml string.

One advantage of using XMLReader is that it uses less memory than SimpleXML or the DOM classes. Another is that it's very fast.

GZipp 2009-12-15 16:38:57

I thought this would be fastest as well but when I bench marked it against the other solutions it turned out to be the slowest. Using an XML file with a thousand nodes to be selected the other solutions were generally about 60% as long to complete (that simplexmlelement xpath solution averaged 5.8 ms while this XMLReader based solution averaged 10 ms) Maybe I did something wrong. Thanks for the advice, though. Helped me understand the whole thing better.

rg88 2009-12-15 19:48:05

I just tested this myself on a very large file and you're right; it is slower than SimpleXML and DOMXPath, and by about the same ratio as your tests showed. That surprises me, also, since I've found it to be generally faster when retrieving all the data, node by node, from large files.

GZipp 2009-12-15 21:09:02

Answer 3

+2 A:

$string = <<<XML
<?xml version="1.0" encoding="utf-8"?>
<everyone>
  <guest>
    <name>Joseph Needham</name>
    <age>53</age>
  </guest>
  <guest>
    <name>Lu Gwei-djen</name>
    <age>31</age>
  </guest>
</everyone>

XML;

$xml = new SimpleXMLElement($string);
$nodes = $xml->xpath('/everyone/guest');

$result = '';
foreach ( $nodes as $node ) {
  $result .= $node->asXML()."\n";
}
echo $result;
die;

Derek Illchuk 2009-12-15 16:41:45

This was easy to do and was as fast or faster than the other solutions. I appreciate the help.

rg88 2009-12-15 19:42:31

Answer 4

+2 A:

Something like this (using XPath - if you have another way to get a list of the guest elements, you can use that) should do the trick.

$xml = '';
$xpath = new DOMXPath($document);
foreach($xpath->query('//everyone/guest') as $guestNode) {
    $xml .= $document->saveXML($guestNode);
}

BlackAura 2009-12-15 16:44:45

This worked but for some reason I kept getting extra space added to things. I could remove it with trim(), I suppose. Thanks for the advice.

rg88 2009-12-15 19:43:20

ansaurus

tags:

views:

answers:

PHP - Extract a chunk of XML from a larger XML document

related questions