ansaurus

Question

Character Support Issue - How to Translate Higher ASCII Characters to Lower ASCII Characters

Answer 1

A:

How big is the range of these input characters? 256? (each char fits into a single byte). If that's true, it wouldn't be hard to implement a 256 value lookup table. I haven't toyed with BASIC in years, but basically you'd DIM an array of 256 bytes and fill in the array with translated values, i.e. the 'a'th byte would get 'a' (since it's OK as is) but the 150'th byte would get a hyphen.

Arthur Kalliokoski 2009-08-07 15:40:48

Answer 2

A:

I tried

System.Text.Encoding.ASCII.GetString(System.Text.Encoding.GetEncoding(1251).GetBytes(text))

But what I got was question marks instead of intelligent translation. That is the long dash should become regular dash and smart quotes should become regular quotes.

Will Rickards 2009-08-07 15:49:40

Answer 3

A:

This seems to work for long dash to short dash and smart quotes to regular quotes. As my html pages has the following as the content type. But it converts all the accented characters to questions marks. Which is not what the Text version of the clipboard has. So I'm closer, I just think I have the target encoding wrong.

<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">

System.Text.Encoding.ASCII.GetString(System.Text.Encoding.GetEncoding("iso-8859-1").GetBytes(m_arrFolderDesc(intIndex)))

Edit: Found the correct target encoding for my purposes which is 1252.

System.Text.Encoding.GetEncoding(1252).GetString(System.Text.Encoding.GetEncoding("iso-8859-1").GetBytes(m_arrFolderDesc(intIndex)))

Will Rickards 2009-08-07 15:59:51

Answer 4

+1 A:

If you convert to a non-unicode character set, you will lose some characters in the process. If the legacy app reading the data doesn't need to do any string transformations, you might want to consider using UTF-7, and converting it back once it gets back into the unicode world - this will preserve all special characters.

bdonlan 2009-08-07 17:50:29

ansaurus

tags:

views:

answers:

Character Support Issue - How to Translate Higher ASCII Characters to Lower ASCII Characters

related questions