ansaurus

Question

Answer 1

A:

$regexp = "FrmKlant.aspx.*\">(.*)<\/a>\s(.*)<br>\s(.*)\s\s(.*)</td>";

amphetamachine 2010-06-24 06:28:00

Answer 2

A:

It is usually not a good idea, to try and extract information from HTML/XML using regular expressions. They a renot well suited to deal with nested structures. Everything you can try will horribly break if your "random html" parts are evil enough, so use them only if have very good control over the html.

Try a parser instead. (Google found me http://simplehtmldom.sourceforge.net/, I have not tried it, though)

Jens 2010-06-24 06:28:06

Answer 3

+3 A:

Use PHP's DOM parser

Incomplete example, but something to get you started:

$dom = new DOMDocument();
$dom->loadHTML($yourHtmlDocument);

$xPath = new DOMXPath($dom);
$elements = $xPath->query('\\random\td\a'); // Or whatever your real path would be

foreach($elements as $node) {
  echo $node->nodeValue;
}

By the way, look at this.

Ivar Bonsaksen 2010-06-24 06:28:52

ansaurus

tags:

views:

answers:

preg_match_all question

related questions