ansaurus

Question

How to get this regular expression to match </span>

Answer 1

+2 A:

Don't use regex to parse HTML
use DOM, particularly the loadHTML method and getElementsByTagName('span')

-

    $doc = new DOMDocument();
    $doc->loadHTML($htmlString);
    $spans = $doc->getElementsByTagName('span');
    if ( $spans->length > 0 ) {
     // loop on $spans
    }

meder 2010-06-20 00:46:49

Answer 2

+2 A:

Don't use regex to parse HTML. Use an HTML parser. See Robust, Mature HTML Parser for PHP.

Jason 2010-06-20 00:47:21

Answer 3

A:

I think the reason that your regex is getting more than you want is because * is greedy, matching as much as possible. Instead, use *?, which will match as little as possible:

preg_match('/<span class="q1">(.*?)<\/span>/', $gem, $match);

dvcolgan 2010-06-20 00:48:28

That works thanks. Reason I don't want to use the DOMDocument class is that it's a very small piece of HTML and this code will only be run once, I'm collecting data to be put into a database. No need to complicate things. :)

VIVA LA NWO 2010-06-20 00:52:44

ansaurus

tags:

views:

answers:

How to get this regular expression to match </span>

related questions