ansaurus

Question

Match only backticks not inside a <code> block with Regex.

Answer 1

+3 A:

I don't think regular expressions are a good tool for this, but it can be done if you assume that the code tags aren't nested:

`(?:(?!</?code>)[^`])*`(?!(?:(?!<code>).)*</code>)

This means:

`(?:(?!</?code>)[^`])*`       : Match something in backticks unless it
                                contains <code> or </code> or a backtick...
(?!(?:(?!<code>).)*</code>)   : unless it is followed by a </code>
                                without a <code> first.

See the regular expression in action at rubular.

Mark Byers 2010-07-10 20:27:55

Perfect. The only change I would make is to replace `.` with `[\s\S]` to handle multi-line `<code>` blocks. I know they aren't the best tool, but I was curious to see how it would be done. No worries, it won't be popping up in a project. =)

Aaron Harun 2010-07-10 20:45:27

Instead of using `[\s\S]` you could simply set the `s` modifier (`PCRE_DOTALL`)

nikic 2010-07-11 08:44:49

ansaurus

tags:

views:

answers:

Match only backticks not inside a <code> block with Regex.

related questions