utf-8

HTML form, inputing utf-8 string firefox and IE !

hi I built form here, When I input utf-8 data to it at firefox 3.6.8 it is like this: but it works fine with IE.8 It seems that while typing (or filling) the input box, the characters are all uppercase. Just like that you are holding shift and type it. anyone knows what is the problem with firefox? edit : it is a simple form ...

Character encoding issues and PHP - what encoding is this?

I have a file on my computer that I wanted to copy into a MySQL table using PHP. When I open the file the contents look fine, like normal text - but, when I attempt to read the file using PHP or insert into a MySQL table, I get all sorts of funky characters. I thought perhaps it was a utf-8 issue, so I tried setting the header header('C...

Transliteration from Ethiopic (and others) to ASCII (ሀ -> ha; ü -> ue)

I am not yet so good with reading Amharic (Geez / Ethiopic) letters. If I have a text in Ge'ez (Ethiopia) letters ( http://en.wikipedia.org/wiki/Ge%27ez_language ) I want to transliterate them to ASCII. When I go with the LYNX Textmode browser to http://www.addismap.com/am/ (webpage in Amharic) it showes me "edis map: yeedis ebeba kart...

[PHP] reading utf-8 content from mysql table

Hi I have a mysql table with contents the structure is here: Now I have one record in it: I want to read and print the content of this table to html This is my code: <?php include("config.php"); $global_dbh = mysql_connect($hostname, $username, $password) or die("Could not connect to database"); mysql_select_db...

How to strip out invalid UTF-8 characters in Ruby 1.9

I just upgraded from Ruby 1.8 to 1.9, and most of my text processing scripts now fail with the error invalid byte sequence in UTF-8. I need to either strip out the invalid characters or specify that Ruby should use ASCII encoding instead (or whatever encoding the C stdio functions write, which is how the files were produced) -- how would...

How to detect the character encoding of a file in Java?

Good day, Our application receives files from our users, and those files must be validated if they are of the encoding type that we support (i.e. UTF-8, Shift-JIS, EUC-JP), and once that file is validated, we would also need to save that file in our system and its encoding as meta-data. Currently, we're using JCharDet (which is a java ...

Cleaning up UTF-16/CJK characters using PHP?

I have some files on my computer that are in UTF-16, though this seems to be because of errors or corruption of the files rather than intent - they're supposed to be plain english. I uploaded one of these (here). If I leave the encoding in Firefox (Viwe>Character Encoding) at UTF-8 then I get tons of gibberish (see screenshot). If I chan...

Sqlite3 on Iphone: Using Non-English Characters

Hi, I'm developing an Iphone application that uses non-english strings (Hebrew) in sqlite3 DB. However, the application only displays my data when using english strings! Values with hebrew characters returns null. I've tried adding hebrew strings through sqlite on the mac, in the terminal - but sqlite won't even recognize the charact...

Utf8 problem with IE and GET request using PHP

Using javascript, I'm setting the src of an iframe to an URL passing a GET variable, urlencoded with unescape( encodeURIComponent( message ) ). With IE, this works the first time I load the iframe.The utf-8 characters, as the var_dump($_GET) reveals server-side, are okay. But next calls, all I get is garbage. On the other hand, if I j...

Problem converting ISO8859-1 to UTF-8 in PHP

Hello, I am attempting to convert a ISO8859-1 string taken from a MySQL database and convert it to UTF-8 using php. However, when I use the utf8_encode function it removes almost all of the apostrophes from the string (the exceptions seem to be within html fields). Thanks ...

Values in UTF-8 being encoded as NULL in JSON

I have a set of keywords that are passed through via JSON from a DB (encoded UTF-8), some of which may have special characters like é, è, ç, etc. This is used as part of an auto-completer. Example: array('Coffee', 'Cappuccino', 'Café'); I should add that the array as it comes from the DB would be: array('Coffee', 'Cappuccino', 'CafÃ...

Adding a line break to a UITextView

Hi I have a UITextView that takes an NSString with formating stringWithUTF8String. It is getting its values from a database and I want the text in the database to be rendered with breaks within the text. I tried using "\n" to do this but it gets rendered as text. Doing this in my information page of the app as straight text worked but I...

Putting UTF-8 string in variable of type string

Well, sorry about the confusing title but I'm having a slightly annoying problem with character encoding in C#.NET I have a bunch of classes generated from WSDL files, these classes have methods which take string parameters which are then submitted to a remote web service. This remote web service expects all text input to be UTF-8 encod...

Unicode characters show differently in different browsers

So... I'm still in unicode hell... New problem... On my computer, everything shows perfectly. In all browsers. On a co-workers computer, same story. Everything is good. Even in elinks and w3m on one of my Linux VPS'es all the exotic diacritics of Lithuanian and Latvian, and nordic letters, shows perfectly. However, I have had a few ca...

Android newline in my EditText

Hi, i got some data like this "line 1 \n line 2" from my distant server by using json with utf8 encoding, and when i try to put the data in my EditText the newline \n doesn't work in spite of unscaping the data by using URLDecoder.decode() because i use databases. Any suggestions all the best . ...

How do I get the length of a Perl Unicode string input via Ajax or CGI?

Okay, this should be really simple, but I have searched all over for the answer and also read the following thread: http://stackoverflow.com/questions/1326539/finding-the-length-of-a-unicode-string-in-perl It does not help me. I know how to get Perl to treat a string constant as UTF-8 and return the right number of chars (instead of byt...

PHP: Problems converting "’" character from ISO-8859-1 to UTF-8

I'm having some issues with using PHP to convert ISO-8859-1 database content to UTF-8. I am running the following code to test: // Connect to a latin1 charset database // and retrieve "Georgia O’Keeffe", which contains a "’" character $connection = mysql_connect('*****', '*****', '*****'); mysql_select_db('*****', $connection); mysql_s...

How to handle user input of invalid UTF-8 characters?

I'm looking for general a strategy/advice on how to handle invalid UTF-8 input from users. Even though my webapp uses UTF-8, somehow some users enter invalid characters. This causes errors in PHP's json_encode() and overall seems like a bad idea to have around. W3C I18N FAQ: Multilingual Forms says "If non-UTF-8 data is received, an e...

Encoding issue: £ pound symbol appearing as <?> symbol

My database field is set to utf8_general_ci and my websites encoding is utf8. The £ symbol is coming up as a black diamond with a question mark through the center. I tried changing it to &pound; in the database and it just outputted I tried a string replace: $row['Information'] = str_replace("£", "&pound;", $row['Information']); ...

C++, JsonCpp, libcurl and UTF-8 woes

I had some problems making libcurl work with C++ JsonCpp library and, after a lot of research, I came up with this code: int post(const string& call, const string& key, const string& value) { // (...) char* char_data=NULL; struct curl_slist *headers=NULL; headers = curl_slist_append(headers, "Content-Type: application/json; c...