ansaurus

Question

Answer 1

A:

If we remove @, then warnings from loadHTML due to invalid syntax of the html document to load . It may be because of few html tags causing problems.

try to replace

$doc->loadHTML($input);

with

$doc->loadHTML(htmlentities($input));

Hope this helps.

Yogesh 2010-09-25 14:27:43

" LoadHTMLfile: The function parses the HTML document in the file named filename. " File_get_contents loads the webpage into a string. So LoadHTML is the correct function to use.

reggie 2010-09-25 14:32:10

@reggie : i have edited the answer, please check now.

Yogesh 2010-09-25 14:47:36

I'm sorry, but that's just bogus. It would remove all the html tags from the string and would not give the DOM anything to work on.

reggie 2010-09-25 14:51:37

Answer 2

A:

JapanPro 2010-09-25 14:32:41

If you add "var_dump($doc);" to your code, you'll see that $doc is still an empty object. At least it is for me!

reggie 2010-09-25 14:35:38

The error supression is not the problem. The php manual specifically states that a string can be loaded even it is not valid.

reggie 2010-09-25 14:52:47

Answer 3

+1 A:

This is expected behaviour. To see the HTML, use DOMDocument::saveHTML() (or saveXML()).

salathe 2010-09-25 15:16:58

Thanks, I see you are right. So why can't I view the dom document object's contents? Is shows the behavior of a resource?

reggie 2010-09-25 15:20:28

Answer 4

+1 A:

The output is: object(DOMDocument)#3 (0) { }

Yes. That's what a var_dumped DOMDocument looks like.

If you want to look at the HTML representation of the content inside the document, saveHTML() on it. That spits out a cleaned up version of the HTML on Google's home page for me.

bobince 2010-09-25 15:17:52

ansaurus

tags:

views:

answers:

Dom LoadHTML Problem in PHP

related questions