Do you think character encoding is table-looking up? | ansaurus

tags:

encoding

views:

83

answers:

1

Q:

Do you think character encoding is table-looking up?

For all the chararacter encodings that I have seen, they all have a code table, each code point corresponding to a character that should be represented/drawed. It seems to fall into the MVC pattern.

The wierd character are caused because the programs are looking up the wrong table for the given code points.

If so, it will be no matter for me to copy some wierd characters from MS notepad to Ultraedit, and choose a suitable encoding type in Ultraedit to read out them. But I cann't do so, why???? am I wrong?

A:

Well, it is a bit more than simply a code-table, as different encodings use different numbers of bytes per code-point; UTF8, for example, is variant length - making it particularly risky to get wrong.

Ultimately, if you try to open a file with the wrong encoding, then the data should be considered corrupt and suspect.

Marc Gravell 2009-04-14 04:09:38

related questions

C++ strings: UTF-8 or 16-bit encoding?

Can you Distribute a Ruby on Rails Application without Source?

Does C# have an equivalent to JavaScript's encodeURIComponent()?

Best he-aac encoder on linux ?

Changing the default encoding for String(byte[])

How to: Pass an ampersand in a lousy filename to a flash object on a webpage

What is the best way to change the encoding of text in PHP

How to send SOAP requests in ISO-8859-1 with Flex ?

Powershell: Setting Encoding for Get-Content Pipeline

Best practice: escape, or encodeURI / encodeURIComponent

Trying to convert bunch of jpegs into a movie

Problem with unicode String literal in unit test

ASP.NET WebService Returns Gibberish Characters When Throwing Exceptions

JavaFX video encoding

Real-time wmv video encoding in C#

PHP Include function outputting unknown char

Base64 Encoding Image

How can I generate a unique, small, random, and user-friendly key?

Problems while submitting a UTF-8 form textarea with JQuery/AJAX

How do you troubleshoot character encoding problems?

How does Ruby 1.9 handle character cases in source code?

How do you remove invalid hexadecimal characters from an XML-based data source prior to constructing an XmlReader or XPathDocument that uses the data?

Strange characters in PHP

Html entities inside asp.net page

Test serialization encoding