ansaurus

Question

REGEXP to convert any 3 chars or less word to wordVVV

Answer 1

A:

You're pattern's not detecting or grouping things right.

Use \w for word-characters and standard parenthesis instead of square brackets, and you're not evaluating PHP code in the replacement, you're simply referring to captured text segments, so don't need the e flag:

$pattern = '\b(\w{1,3})\b';
$text = mb_ereg_replace($pattern, '\0VVV', $text, 'm');

Alternatively, use preg_replace with the unicode flag:

$text = preg_replace('/\b\w{1,3}\b/um', '\0VVV', $text)

If you need to cater for arabic and right-to-left characters, you need to us unicode character properties instead of \w and \b (\w doesn't match letters from all languages, and \b only matches between \w\W and \W\w - which are both broken wrt. non-latin languages.)

Try this intead:

$text = preg_replace('/(?

(and again cos I can't tell whether I need to encode < or not)

$text = preg_replace('/(?<!\PL)(\pL{1,3})(?:\PL)/um', '\1VVV', $text);

searlea 2009-09-03 15:33:24

Nop, something is still wrong, check the question, I added an actual code snippet that shows the problem.

Itay Moav 2009-09-03 17:38:35

I've editted the ansewr to include examples using `\pL` and `\PL` with negative look-ahead and look-behind assertions substituting for the latin-only `\b` word-boundary detection.

searlea 2009-09-03 19:33:57

Answer 2

A:

This should match what you want?

\b(?<Match>\w{1,3})\b

Chad 2009-09-03 17:49:52

ansaurus

tags:

views:

answers:

REGEXP to convert any 3 chars or less word to wordVVV

related questions