ansaurus

Question

How to match the character '&' and replace it in php

Answer 1

+2 A:

I think you're looking for html_entity_decode.

Dominic Rodger 2009-07-31 12:01:31

That would turn it in a non-breaking space character, not a space.

David Dorward 2009-07-31 12:03:35

I am in a way looking for just that, but am just worried that some function actually does htmlentities() call for me before returning the output. Is it not a security issue to run html_entity_decode on a string? But I am also interested to do it with some regular expression matching.

2009-07-31 12:11:59

-1 It would convert any character reference and not just ` `.

Gumbo 2009-07-31 14:27:02

@Gumbo - just re-read the question, and I still think (the original question at least) that's what the OP asked for. Maybe I'm being thick though.

Dominic Rodger 2009-07-31 14:56:00

Answer 2

+2 A:

Take a look at html_entity_decode function.

Kuroki Kaze 2009-07-31 12:01:38

That would turn it in a non-breaking space character, not a space.

David Dorward 2009-07-31 12:02:36

You can run over the string and replace U+00A0 with U+0020 afterwards.

Joey 2009-07-31 12:04:23

This was answer to a second question, about other entities :)

Kuroki Kaze 2009-07-31 12:05:12

-1 It would convert any character reference and not just ` `.

Gumbo 2009-07-31 14:20:40

Answer 3

+1 A:

str_replace should replace that part of the text as it doesn't take regular expressions in account, so there is some other problem i guess

dusoft 2009-07-31 12:02:30

Answer 4

A:

I believe the function you're looking for is http://us2.php.net/manual/en/function.urldecode.php urldecode

Rob 2009-07-31 12:03:09

The string is encoded with an HTML entity, not URL encoding. And he asked for a space, not the decoded version of the non-breaking space entity.

David Dorward 2009-07-31 12:04:51

Answer 5

+1 A:

<?php
   $string = "<p>Hello,& n b s p ;world</p>"; # Remove the spaces here - Stackoverflow bug doesn't let me enter the normal string.
   $string = str_replace("& n b s p ;", " ", $string);
   print $string;
?>

This works for me. Perhaps you were expecting it to modify the string in place instead of returning the modified version?

David Dorward 2009-07-31 12:04:03

Tried fixing the source but failed, it appears stackoverflow has a bug!

Dominic Rodger 2009-07-31 12:08:09

Lepidosteus 2009-07-31 14:24:14

@Lepidosteus - That works for normal text but not for code blocks. is rendered as but is rendered as a non-breaking space.

David Dorward 2009-07-31 15:29:31

Answer 6

+4 A:

Are you just doing this?

str_replace("&nbsp", " ", $mystr);

Or do you do this?

$mystr = str_replace("&nbsp", " ", $mystr);

Both str_replace and preg_replace return a value, they don't change the string in-place.

Aistina 2009-07-31 12:05:34

No, i was doing as you have printed, that is, collecting what was returned as output.

2009-07-31 12:10:36

Answer 7

A:

Have you tried:

$text=html_entity_decode(str_replace('& nbsp;',' ',$text));

[remove the space between the ampersand and nbsp: it's due to Stack Overflow's formatting]

It'll swap the no-breaking-spaces with normal spaces and then decode any other remaining html entities.

Richy C. 2009-07-31 13:06:09

Answer 8

A:

What you actually need is an HTML filter based on a proper HTML parser so you can let only specified bits and pieces of HTML be passed through by your script.

Sinan Ünür 2009-07-31 14:06:37

Answer 9

A:

Look at HTML Purifier. Give it a whitelist of allowed tags/attributes, and it will filter everything for you.

Gordon 2009-07-31 14:16:02

Answer 10

A:

Since the trailing semicolon may be obmitted, you might want consider using a regular expression:

preg_replace("/&nbsp[;]?/", " ", $str)

You can replace [;]? by ;?. But Stack Overflow seems to replace &nbsp‍; (this is written with a ZERO WIDTH JOINER U+200D) so I used [;]?.

Gumbo 2009-07-31 14:23:33

ansaurus

tags:

views:

answers:

How to match the character '&' and replace it in php

related questions