tags:

views:

65

answers:

1

What is best practice to repair malformed XML files with PHP? For example CDATA part contains illegal chars. With regular expressions? Or execute some Linux command line tools?

+1  A: 

Tidy

Tidy is a binding for the Tidy HTML clean and repair utility which allows you to not only clean and otherwise manipulate HTML documents, but also traverse the document tree.

// Specify configuration
$config = array(
           'indent'     => true,
           'input-xml'  => true,
           'output-xml' => true,
           'wrap'       => false);
// Tidy
$tidy = new tidy;
$tidy->parseFile('sample.xml', $config);
$tidy->cleanRepair();
// Output
echo $tidy;
Mads Hansen
thanks Mads, exactly what I need :)
Ain