This one's driving me nuts . . .
I have a bunch of MS Word files that a client wants displayed on his web site. I've converted them to HTML using "Save as Web Page" -- and yes I know that this produces lousy HTML but other methods I've tried lose the links to the imbedded images.
For the most part, I can use PHP to clean up the display but one item has me completely baffled: All single and double quotes are coming through as various letters with diacritics (accents) and I can't figure out how to detect them and convert them to the correct HTML entities. For example: Õ (O tilde)should be single-quote, Ò (O grave) should be open double-quote, Ó (O acute) should be close double-quote. I've tried htmlentities, iconv and a bunch of other methods with no luck. I welcome suggestions.
Thanks
Mark