views:

205

answers:

3

I have an asp.net page that exports some data to Microsoft Word 2003. The source of the data is what users have typed into an ajax control toolkit HtmlEditor on an input page. All works well unless the user has pasted text from a Word document into the HtmlEditor.

The html that is copied from Word looks like this:

<p class="MsoBodyText" style="margin: 0in 0in 0pt"><font color="#000000"><br />\r\nThe Blah Blah Blah of Southern California’s blah blah qualify for a blah of “Rating” with a “hold” status.&nbsp;</font></p>

When the content is rendered in Word, it looks like this:

The Blah Blah Blah of Southern California’s blah blah qualify for a blah of “Rating†with a “hold†status.

Any help on this? I have no problem when I force the HTML into a div and show it on the page. It's only on the export to Word that it gets messed up. This happens whether I paste the Word text right into the HtmlEditor or use the Paste From MS Word (with cleanup) button.

Thanks. Andrew.

+1  A: 

I never thought I would ever read the phrase "exports some data to Microsoft Word". Fail.

Your program is creating the Word doc programmatically, correct? It looks like you have a binary error on single quotes and double-quotes. How are you creating the Word doc? Interop library?

Yoenhofen
A: 

This is a text encoding problem, and your "html that is copied from Word" is wrong. You've used single and double quotes (ASCII characters 39 and 34, or hex 0x27 and 0x22 respectively), while Word is using smart quotes. They're getting garbled during the copy and paste between Word and the HTMLEditor, and then appearing as the wrong character encoding when pasted back to Word.

If you save the text from the HTMLEditor and look at it with a hex viewer, you'll see the problem immediately.

I can't help you with the "ajax control HTMLEditor" and reconfiguring it to fix this, as I'm not familiar with it.

Ken White
Thanks. I appreciate the help.
andypoi
A: 

Same problem to me. Yes, using c# code and REsponse.Write I am pushing word file from server. When I open it, I am seeing the special characters. These are coming for ul, li bullets.

for character • coming symbols are • And for character - coming symbols are –

Any ideas?

-Praveen.

Rare Solutions