PHP XPath explicit node... | ansaurus

tags:

views:

402

answers:

1

Q:

PHP XPath explicit node...

I am trying to pull an exact table during a "web scrape." Used cURL to pull page into $html, which succeeds fine.

Used Firebug to get exact XPATH to the table needed.

Code follows:

$dom = new DOMDocument($html);
$dom->loadHTML($html);

$xpath = new DOMXpath($dom);
$summary = $xpath->evaluate('/html/body/table[5]/tbody/tr/td[3]/table/tbody/tr[8]/td/table');
echo "Summary Length: " . $summary->length;

When executed, $summary->length is always zero. It doesn't pull that table node.

Any ideas?

+1 A:

Firefox is liable to insert "virtual" tbody elements into tables that don't have them; do those elements exist in the original file?

Rob Kennedy 2009-05-07 20:26:15

No, they don't. But I do see them in firefox.I have used XPath Checker as well and can see the data I need. But using it in my PHP xpath->evaluate never returns data.

2009-05-07 20:46:42

<tr> is not allowed inside <table> directly - there has to be a <tbody> / <thead> / <tfoot>. It's implied if not specified directly. HTML is weird like that... the start and end tags can both be optional!

Greg 2009-05-07 20:55:07

If the the tbody elements don't exist in the original file, then they shouldn't be in your PHP xpath query.

Frank Farmer 2009-05-07 21:01:40

I apologize. The TBODY tags are there. I overlooked them when first looking at the source.

2009-05-07 21:05:51

related questions

IDE suggestions: Eclipse IDE vs. Zend Studio ( confused )

MySQL/Apache Error in PHP MySQL query

Lightweight IDE for Linux

What PHP framework would you choose for a new application and why?

Why is my ternary expression not working?

How can I get at the matches when using preg_replace in PHP?

Mechanisms for tracking DB schema changes

Wordpress theme development offline tools

Using object property as default for method property

How can I get the authenticated user name under Apache using plain HTTP authentication and PHP?

Make XAMPP/Apache serve file outside of htdocs

How do you debug PHP scripts?

PHP Variables passed by value or by reference?

Best way to implement unit testing in PHP

Connect PHP to an AS/400

Best way to access Exchange using PHP?

PHP Session Security

How do I access a remote form in php?

What's the best way to generate a tag cloud from an array? (using h1 through h6 for sizing)

Apache/PHP: error_log per Virtual Host?

How do I track file downloads with apache/PHP

How would you access Object properties from within an object method?

Flat File Databases in PHP

Best way to allow plugins for a PHP application

Latest information on PHP upcoming releases