tags:

views:

42

answers:

3

Possible Duplicate:
I'm looking for a regular expression to remove a given (x)HTML tag from a string

I have a long HTML file and I need to remove all the <img /> tags inside it and all the <a><img /></a> anchors.

What I'm thinking of is writing a PHP script that does the job. But each image and link has different number attributes so I don't know how I can do this neatly. Any help would be strongly appreciated.

A: 

try <img[^>]+/>

joni
+1  A: 

Use a DOM parser such as PHP Simple HTML DOM Parser or PHP DOM

webbiedave
A: 

Thanks for the answers guys, this is the final solution that I was looking for.

$text = preg_replace("/<a[^>]+\><img[^>]+\><\/a>\n/i", "", $text);

$text = preg_replace("/<img[^>]+\>/i", "", $text);
Jon Doe
It's easy to construct valid HTML that makes those regexes misbehave.
tchrist