ansaurus

Question

Answer 1

A:

Why don't you write it like this:

$str = 'bcs >Hello >If see below!';
$repstr = preg_replace('/>If see below[^,\.<]*/','',$str);
echo $repstr;

Peter Stuifzand 2009-05-14 13:43:55

because What I want is the first capitalized character or number after >

Shore 2009-05-14 13:45:50

Answer 2

A:

This might be a good alternative to what you have. The problem with your regexp is that instead of selecting what you want, you are selecting what you don't want and replacing that with an empty string. The best approach, in my opinion, is selecting what you want, that is what the code below does. What you end up with is what is what is matched by the first sub-pattern otherwise you get your string back.

$str = 'bcs >Hello >If see below!';
$repstr = preg_replace('/^([\w]+ >[\w]+).*?see below.*?$/i', '$1', $str);
var_dump($repstr);

I hope this helps.

partoa 2009-05-14 13:59:03

Sorry,what I want to do is exactly replace:start from "first capitalized character or number after >"end with "see below[^,\.<]*"to empty.

Shore 2009-05-14 14:35:15

Answer 3

+2 A:

I think the problem is that you're misinterpreting how a non-greedy quantifier acts. Once it's in operation, yes, it stops earlier than it would otherwise. But it isn't aware of what comes before it (or potentially the text that comes later, either). It's only concerned with it's current position. Hence, the regular expression you posted will match all of:

">Hello >If see below!"

Let's see how this works:

/>[A-Z0-9].*?see below[^,\.<]*/

The regex first looks for ">" in "bcs >Hello >If see below!", and finds the first one, which is the one right before "Hello". Ok, let's check the next part of the expression:

[A-Z0-9]

The next char is a H, which matches the pattern [A-Z0-9]. Still good! Next:

.*?

Now we match all non-newline chars until we get to the first instance to match the remaining expressions of "see below[^,.<]*". If we had used just a plain greedy quantifier, we could match through multiple cases of "see below[^,.<]*" until we matched the last possible one. (So if your string had continued on, and there'd been other text match that pattern, it would have captured that as well) The non-greedy quantifier doesn't mean that your whole pattern will return the smallest possible match of all possible matches in the string. It just dictates how that particular character match functions.

You might want to try the following pattern then:

/>[A-Z0-9][^>]*?see below[^,\.<]*/

Hopefully this clears it up!

patjbs 2009-05-14 16:33:27

Thank you for your reply,but this won't work for me.Because it won't work in this case:$str = 'bcs <>Hello <>If <br> see below!';I want to have 'bcs <>Hello <' after processing.

Shore 2009-05-15 13:06:09

You might try elaborating the context of your question more then, and you might get some better answers.

patjbs 2009-05-15 15:54:15

Here is the solution:$str = 'bcs <>Hello <>If <br> <br> see below!';$repstr = preg_replace('/>[A-Z0-9][^>]*(>[^A-Z0-9]*)*see below[^,\.<]*/','',$str);echo $repstr;Thank you for your attention on this problem:)

Shore 2009-05-15 16:35:24

ansaurus

tags:

views:

answers:

php non-greedy regex problem

related questions