ansaurus

Question

HTML Parser to Get Content between Elements

Answer 1

+1 A:

If all the text except the event name is always the same, you can do it with just a substring (since the start and end bits will always be the same length)

$event_name = substr($current_line, 98, -14);

That'll give you what's left over when you remove the first 98 characters and the last 14.

Chad Birch 2010-03-11 23:44:12

Answer 2

A:

You could use PHP's DOM manipulation functions.

Basically, you'd create a new DOMDocument via DOMDocument::loadHTML() or DOMDocument::loadHTMLFile(), and then use $yourDOmObject->getElementsByTagName() to get all the <span> elements.

Josh 2010-03-11 23:47:36

Answer 3

+1 A:

Assuming:

All event names are in divs
The containing div must have the class "Center"
All divs with the class "Center" contains the name of an event

Here goes:

<?php

$content = '
<span class="cell CellFullWidth"><span class="SectionHeader">EVENT</span><br/><div class="Center">Event Name1</div></span>
<span class="cell CellFullWidth"><span class="SectionHeader">EVENT</span><br/><div class="Center">Event Name2</div></span>

';

$html = new DOMDocument();

$html->loadHTML($content);

$divs = $html->getElementsByTagName('div');

foreach($divs as $div) {
    if($div->getAttribute('class') == 'Center') {
        $events[] = $div->nodeValue;
    }
}

print_r($events);

chelmertz 2010-03-12 00:00:49

+1, nicely implemented

Josh 2010-03-12 00:03:17

ansaurus

tags:

views:

answers:

HTML Parser to Get Content between Elements

related questions