ansaurus

Question

I have HTML comments being wrapped in Li and P tags :(

Answer 1

A:

Something like this?

$final = preg_replace("/<li><p>(<!--.*-->)<\/p><\/li>/", "$1", $original);

Zed 2009-08-01 19:06:11

Answer 2

A:

EDIT: following Zed's and your comments I've done some testing and this is what you should use:

$final = preg_replace('/<li><p>[\s]*?&lt\;!--(.*?)--&gt\;<\/p><\/li>/m', "<!--$1-->", $z);

Here is a breakdown of the RE:

<li><p>

this is obvious

[\s]*?

because you have a few spaces and a newline between the <li> and the comment, but we want the least number of newlines so we use the non greedy *? (it sould work with * as well)

&lt\;

need to escape the ;

!--(.*?)--

again we use *? so we would match only this line (other wise if you had the same line again it wold match from the first one to the last one

&gt\;<\/p><\/li>

same as above

/m'

so php would treat newlines as whitespace (i am not sure about this but it seems to be working)

Nir Levy 2009-08-01 19:17:29

no luck... http://www.whatcouldicook.com/recipes/1177

bluedaniel 2009-08-01 19:22:56

So now we can only hope for not having a > sign in the comment ;)

Zed 2009-08-01 19:32:52

Daniel, it's not working, because your html source is incorrect. The comment have  on the sides instead of < and >.

Zed 2009-08-01 19:33:54

@Daniel, Zed is right please do not use what i wrote. @Zed you are right, i should have used non-greedy ?* but i don't have any way of testing this now. sorry.

Nir Levy 2009-08-01 19:38:20

so $final = preg_replace("/<li>( wont work?

bluedaniel 2009-08-01 20:44:55

anyone hazard a guess? it shouldnt be too difficult to preg_replace two tags?

bluedaniel 2009-08-02 01:04:21

thank you so much!! What would i do without Stack?

bluedaniel 2009-08-02 10:15:12

Answer 3

A:

@Zed:

Lets be more caring:

$final = preg_replace("/<li><p>(<!--.*?-->)<\/p><\/li>/", "$1", $original);
# use .*? every time over .* unless you specificly want what it does
# .*? matches as less as it can
# .* matches as much as it can

even better:

$final = preg_replace("/<li><p>(<!--[^\-\>]+-->)<\/p><\/li>/", "$1", $original);
# [^\-\>]+ will look for any character that is not - or > 
# so will perform faster

Just trying to advocate better regex practice. Hope this helps.

vulcan_hacker 2009-08-10 05:01:35

well the FINAL winning statment was:preg_replace('/<li>[\s]*?!--(.*?)--<\/p><\/li>/m', "", $original);can it be improved in the same manner?

bluedaniel 2009-08-16 23:00:17

There is no point using [\s]*?. You should use \s*. If you had a set of characters like say spaces or digits, you could use [\s\d] or [\s0-9]. To make it match more than single character, use + (1 or more) or * (zero or more) or ? (none or one) after it. Then think about using *? to match zero or more, (match as minimum as possible). You can understand \s*? is not as sensible as .*?So use \s+ (at least one space, may be more ) or \s* (zero or more space) or \s? (zero or one space) here.preg_replace('/<li>\s*!--(.*?)--<\/p><\/li>/m', "", $original);

vulcan_hacker 2009-08-17 05:34:07

Why can't I format my comment? Can the moderators do that?

vulcan_hacker 2009-08-17 05:37:03

ansaurus

tags:

views:

answers:

I have HTML comments being wrapped in Li and P tags :(

related questions