ansaurus

Question

Inserting multiple links into text, ignoring matches that happen to be inserted

Answer 1

+1 A:

Using Regexes to process HTML is always risky business. You will spend a long time fiddling with the greediness and laziness of your Regexes to only capture text that is not in a tag, and not in a tag name itself. My recommendation would be to ditch the method you are currently using and parse your HTML with an HTML parser, like this one: http://simplehtmldom.sourceforge.net/. I have used it before and have recommended it to others. It is a much simpler way of dealing with complex HTML.

SimpleCoder 2010-09-13 21:40:27

I couldn't figure out how that library you mentioned would help me with this specific problem.

mattalexx 2010-09-22 02:20:42

You would have used it to parse the HTML and access the DOM. There, you could perform the operations you want on the DOM explicitly.

SimpleCoder 2010-09-22 16:20:48

Answer 2

A:

I ended up using preg_replace_callback to replace all existing links with placeholders. Then I inserted the new glossary term links. Then I put back the links that I had replaced.

It's working great!

mattalexx 2010-09-22 02:22:07

ansaurus

tags:

views:

answers:

Inserting multiple links into text, ignoring matches that happen to be inserted

related questions