utf-8

Different between Mysql Collation "Utf-8-general-ci" & Utf-8-unicode-ci ?

Dear All, What is the Different between Mysql Collation "Utf-8-general-ci" & Utf-8-unicode-ci ? regards, koko ...

MySql not storing foreign characters correctly

On SQLITE this way fine. I switch to mysql and instead of クイン i get ???. The column is TEXT or sometimes VARCHAR(255) (i believe its VARCHAR(255) in this case). How do i have mysql save the characters properly? ...

broken UTF-8 String ruby

While reading a file I get broken UTF-8 String error whenever I have the following in my file través if I change it to normal e then it works. Whats the way to fix this? error only happens if I do line.lstrp or any other function. Just printing the lines is ok. problem even happens when I try to match the string with regex. ...

How do I change a MySQL table to UTF-8?

I know there are many settings for a language for a table and a database. I already created the database. I believe when I created it, it was default/LATIN. I want to change everything-I mean...both the table and the database, to UTF-8. How can I do that? thanks. ...

How can I convert data encoded in WE8MSWIN1252 to utf8 for use in Python scripts?

This data comes from an Oracle database and is extracted to flatfiles in encoding 'WE8MSWIN1252'. I want to parse the data and do some analysis. I want to see the text fields but do not need to publish the results to any other system so if some characters do not get converted perfectly I do not have a problem with that. I just do not w...

Working with utf-8 files in Eclipse.

Quite straight forward question. Is there a way to configure Eclipse to work with text files encoded with utf-8 with and without the BOM? So far I've used eclipse with utf-8 encoding and it works, but when I try to edit a file generated by another editor that includes the BOM, Eclipse doesn't handle it properly, it 'shows an invisible c...

i18n / Markdown - Does Markdown support internationalization?

I'm building a CMS which needs to manage content in english, chinese, and spanish at a minimum. Do most markdown implementations handle UTF-8 encoded text? Is the Markdown language designed to be used with non-english languages? I'm currently using Markdown Extra by Michel Fortin. ...

How Do I Properly Format Characters From eBay and Amazon Catalog?

I am using the Amazon and eBay API via PHP5 and Curl to get product listings for a client. When I ask the API to give me a description or a category for a product, I'm sometimes getting something that looks like this: Estée Lauder When I go look at the XML download from the API, it shows this entry inside: <categoryName>Estée Laude...

Want to show <embed> and <object> tags from YUI editor as a text rather then a video.

I am using YUI rich text editor on my website (php/mysql), so that a user may enter textual matter/articles through it. But if a user copies and paste some embed code in the textarea, from any video sites like youtube, it should get saved as a text block and not as a playing video when showing the text content on the browser. Now YUI aut...

Saving FileSystemObject as UTF

how can i save the file in utf-8? Dim FSO, File Set FSO = Server.CreateObject("Scripting.FileSystemObject") Set File = FSO.OpenTextFile(Path,2,true,-1) File.Write(xml1) File.Close Set File = Nothing Set FSO = Nothing ...

This is my current way of writing to a file. However, I can't do UTF-8?

f = open("go.txt", "w") f.write(title) f.close() What if "title" is in japanese/utf-8? How do I modify this code to be able to write "title" without having the ascii error? Edit: Then, how do I read this file in UTF-8? ...

php reading mysql bit field returning weird character

I am using mysql_fetch_assoc($query), one of the bit field returns out to be , which is supposedly to be true. The problem is that I also need to output this to xml and it's an illegal xml character. the charset for the db table is utf-8. why does this happen? ...

How do I ignore the UTF-8 Byte Order Marker in String comparisons?

I'm having a problem comparing strings in a Unit Test in C# 4.0 using Visual Studio 2010. This same test case works properly in Visual Studio 2008 (with C# 3.5). Here's the relevant code snippet: byte[] rawData = GetData(); string data = Encoding.UTF8.GetString(rawData); Assert.AreEqual("Constant", data, false, CultureInfo.InvariantCu...

trouble with utf-8 chars & apache2 rewrite rules

I see the post http://stackoverflow.com/questions/2565864/validating-utf-8-in-htaccess-rewrite-rule and I think that is great, but a more fundamental problem I am having first: I needed to expand to handle utf-8 chars for query string parameters, names of directories, files, and used in displays to users etc. I configured my Apache...

how to properly display utf encoded characters on my utf-8 encoded page?

Hi guys I'm retrieving emails and some of my emails have utf encoded text. However even though my page is encoded as utf 8 - in some places when I try to out put utf text I get funny characters like : =?utf-8?B?Rlc6INqp24zYpyDYotm+INin2LMg2YXYs9qp2LHYp9uB2bkg2qnbjCDZhtmC?= =?utf-8?B?2YQg2qnYsdiz2qnYqtuSINuB24zaug==?= Whereas in othe...

QueryString encoding of non-ANSI characters in ASP.Net

I'm passing "Malmö" as a Request.QueryString parameter to a page. However, the code sees it as "Malm�" meaning that string comparison fails. All globalization settings are set to UTF-8 in web.config. Am I missing something? Edit: The querystring looks like this http://localhost/PageName/?courseKommun=Malm%F6 ...

Repair bad character due to encoding problem

Hi all, Recently we had an encoding problem in our system : If we had the string "æ" in our db ,it became "æ" on our web pages. Now this problem is solved, but the problem is that now we have a lot of "æ" in our database : users didn't see and validate pre-filled form with these characters. I found that If you read in utf 8 C3A6 yo...

Servlet receiving data both in ISO-8859-1 and UTF-8. How to URL-decode?

I've a web application (well, in fact is just a servlet) which receives data from 3 different sources: Source A is a HTML document written in UTF-8, and sends the data via <form method="get">. Source B is written in ISO-8859-1, and sends the data via <form method="get">, too. Source C is written in ISO-8859-1, and sends the data via <a...

Search tamil fonts in mysql and php

Hi, How to search tamil fonts and retrieve data from mysql and show the result using php? ...

How to decode Unicode escape sequences like "\u00ed" to proper UTF-8 encoded characters?

Is there a function in PHP that can decode Unicode escape sequences like "\u00ed" to "í" and all other similar occurrences? I found similar question here but is doesn't seem to work. ...