ansaurus

Question

Read external HTML page and then find data within

Answer 1

A:

You are complicating way too much. Simply load the page content and then search for the proper regex (preg_match()). This will do fine

preg_match('~<tag id="foobar">(?P<content>.*?)</endtag>~is', $input, $matches);

Mikulas Dite 2010-05-19 20:50:20

Yes, you could use RegEx to parse HTML, [or not](http://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags/1732454#1732454)

hemp 2010-05-19 21:41:57

Everybody knows that html is nonregular language. But the question in fact was: I have a text wrapped with some static phrases, how do I find it? Dom is much slower (and in php is even worse than in other languages) than simple regex.

Mikulas Dite 2010-05-20 06:51:55

Answer 2

+1 A:

Maybe this could help: http://simplehtmldom.sourceforge.net/

Nort 2010-05-19 21:39:08

Answer 3

A:

If you use HTQL COM to query the page, the query is: <dd>1:tx

seagulf 2010-05-21 02:04:55

ansaurus

tags:

views:

answers:

Read external HTML page and then find data within

related questions