I'm storing a PHP array where key=>value pairs are information used to build jQuery UI tabs on a website.
The string stored in the MySQL database looks like:
a:2:{i:0;a:2:{i:1;s:9:"Info";i:2;s:643:"<h2><strong>This section is about foo</strong></h2><p><strong>Lorem ipsum ...";}i:1;a:2:{i:1;s:14:"More Info";i:2;s:465:"<p>Lorem ipsum ...";}}
(not a valid serialized array or html because I truncated the lengthy content for formatting reasons)
I would like to allow this content to be fed to Sphinx (full-text indexer) for site search purposes. Basically Sphinx just grabs the contents of the database and indexes what it finds, subject to the configuration options you specify... What I'm wondering is if there's a good way to get either MySQL or Sphinx to strip out the serialization information and html tags so that only the plain text gets indexed.