ansaurus

Question

Answer 1

A:

If you are using PHP, you just need to use a regular expression to replace anything that matches PHP code.

The following statement will remove the PHP tag:

preg_replace('/^<\?php.*\?\>/', '', '<?php $db1 = new ps_DB() ?><p>Dummy</p>');

If it doesn't find any match, it won't replace anything.

jeph perro 2010-07-15 18:28:34

Answer 2

+2 A:

 <?php
 function filter_html_tokens($a){
    return is_array($a) && $a[0] == T_INLINE_HTML ?
      $a[1]:
      '';
 }
 $htmlphpstring = '<a>foo</a> something <?php $db1 = new ps_DB() ?><p>Dummy</p>';
 echo implode('',array_map('filter_html_tokens',token_get_all($htmlphpstring)));
 ?>

As ircmaxell pointed out: this would require valid PHP!

A regex route would be (allowing for no 'php' with short tags. no ending ?> in the string / file (for some reason Zend recommends this?) and ofcourse an UNgreedy pattern):

preg_replace('/<\?.*?(\?>|$)/', '',$htmlphpstring);

Wrikken 2010-07-15 18:31:19

Just note that you may not get valid HTML out of the regex solution... `<?php $foo='?>'; $bar = 'something';?>foo` will yield `'; $bar='something'; ?>foo`. The sort of it, is there's no perfect solution... Combine each to get a "best"...

ircmaxell 2010-07-15 19:10:29

Indeed, no perfect solution. If the actual problem can be solved higher up so our though up kludges don't have to be used it would be far preferable.

Wrikken 2010-07-15 19:19:05

Answer 3

A:

Well, you can use DomDocument to do it...

function stripPHPFromHTML($html) {
    $dom = new DomDocument();
    $dom->loadHtml($html);
    removeProcessingInstructions($dom);
    $simple = simplexml_import_dom($d->getElementsByTagName('body')->item(0));
    return $simple->children()->asXml();
}

function removeProcessingInstructions(DomNode &$node) {
    foreach ($node->childNodes as $child) {
        if ($child instanceof DOMProcessingInstruction) {
            $node->removeChild($child);
        } else {
            removeProcessingInstructions($child);
        }
    }
}

Those two functions will turn

$str = '<?php echo "foo"; ?><b>Bar</b>';
$clean = stripPHPFromHTML($str);
$html = '<b>Bar</b>';

Edit: Actually, after looking at Wrikken's answer, I realized that both methods have a disadvantage... Mine requires somewhat valid HTML markup (Dom is decent, but it won't parse foo<?php echo $bar). Wrikken's requires valid PHP (any syntax errors and it'll fail). So perhaps a combination of the two (try one first. If it fails, try the other. If both fail, there's really not much you can do without trying to figure out the exact reason they failed)...

ircmaxell 2010-07-15 18:35:23

Good point, with invalid PHP mine would indeed fail. Added it to the answer for good measure.

Wrikken 2010-07-15 19:01:25

ansaurus

tags:

views:

answers:

How to remove php code from a string?

related questions