tags:

views:

111

answers:

1

I'm new to Perl and playing around screen scraping and regex. I'm trying to extract the "title" name of the following HTML block:

... title="The Valley Downs Chicago"><img class="vimg120" ...

My simple Perl code to do so is:

@htmlBlocks = split ("margin-bottom:20px",$content);
foreach $item (@htmlBlocks)
{
if (/\stitle="([^"]*)"/six)
{
 print $1;
}
}

but it doesn't print anything. while i'm troubleshooting it, i though i'd ask the experts if you see anything wrong or potentially problematic. thanks so much in advance for your help!

+9  A: 

By default // search uses $_ variable. If you want to search another variable then specify it before =~. Here is example:

@htmlBlocks = split ("margin-bottom:20px",$content);
foreach $item (@htmlBlocks) {
    if ($item =~ /\stitle="([^"]*)"/six) {
        print $1;
    }
}
Ivan Nevostruev
learning a lot! thanks so much!
BeachRunnerJoe