encoding

Encoding problem in phpMyAdmin. Specifically with records containing russian text.

I'm making small module/plugin for my future CMS/Framework. I wanted it from begining to be with multi-language support. I set my new database to utf8_unicode_ci (I read that it is more accurate, then utf8_general_ci) Set my files to UTF-8 without BOM Every page has in head Content-Type: text/html; charset=utf8 When I register new us...

Sitemap Encoding Woes

Hi, I'm having real trouble understanding the specification and guidelines on how to properly escape and encode a URL for submission in a sitemap. In the sitemap.org (entity escaping) examples, they have an example URL: http://www.example.com/ümlat.php&q=name Which when UTF-8 encoded ends up as (according to them): http://www.e...

How is the blob of text encoded? Base64? Something else?

I am having a string like this H0TCxoL9HSXXwlwXgBJAAAaiAAACBAW0AQMDAAEBBAIA what format is this? Below are some more strings. I thought as base64 but when i decode that i get strings like D????&??_?P@ H0TCxoL9HSbXwl+NUBBAAHISAABIVFRQLzEuMSAyMDAgT0sNCkRhdGU6IE1vbiwgMjEgSnVuIDIwMTAgMDk6NTI6NTMgR01UDQpTZXJ2ZXI6IE1pY3Jvc29mdC1JSVMvNi4wDQpQM...

Create an utf-8 csv file in Python.

I can't create an utf-8 csv file in Python. I'm trying to read it's docs, and in the examples section, it says: For all other encodings the following UnicodeReader and UnicodeWriter classes can be used. They take an additional encoding parameter in their constructor and make sure that the data passes the real reader or wri...

Visual Studio 2010 changes files to wrong encoding

I've been annoyed by this for a long time now. Somehow Visual Studio 2010 (VS2008 too IIRC) changes the encoding of my files from "Unicode (UTF-8 with signature) - Codepage 65001" to "Western European (Windows) - Codepage 1252". I have a faint idea that it's either ReSharper or VisualSVN, that's doing the character encoding changes, but...

How to view the XML documents sent to Solr

We're having problems with UTF-8 in Solr, and need to debug the documents that are sent for indexing. Can we do this somehow? Searched all logs I've found, enabled debug="1" in the app XML in the tomcat6 / Catalina directory. Even tried Wireshark, but no dice. Please please! Everything looks good on the PHP side, and this has been work...

Can a base64 encoded string contain whitespace?

Might a base64 encoded string contain whitespace? Specifically, could it contain whitespace at the end of the string? PS. I'm thinking about the whole "MySQL will trim trailing whitespace when storing strings in VARCHAR fields" here ;-) ...

Ruby - How to unpack a binary string into a normal string?

I'm opening a CSV file and reading values from it using File.open(filename). So I do something like this: my_file = File.open(filename) my_file.each_line do |line| line_array = line.split("\t") ratio = line_array[1] puts "#{ratio}" puts ratio.isutf8? end The issue I'm having is the values in line_array seem to be in a strange for...

What type of encoding is being used?

I am currently making a program in which one of its functions is to extract the HTML part of a Multipart email. I have accomplished that task fine however there is a type of encoding on some of the characters that I can't seem to figure out e.g. ',' into '=2C' ';' into '=3B' '=' into '=3D' and it also puts random '=' all over the pla...

How to encode h.264 video from webcam flash client

Hi Is there anyway to send Adobe Air based client can send webcam data via H.264 encoded video to the server - FMS or Red5 . Pls give us any advice . Thanks !! ...

What is the character encoding that could match this conversion: From "§" To "Ç"?

The line bellow is as an example of one of many files with wrong character encoding that I have; REAPRESENTA§AO VIA DTENTRY The correct presentation should be this: REAPRESENTAÇAO VIA DTENTRY There's more characters with wrong encoding. How do I correct this? ...

Extract Chinese text from Query string

Hi All, I need to extract chinese characters from the query string in a ASP.NET web application. When I tried it, I get "????" instead of the actual text. I know I need to decode it with UTF-8 but its doesnot work. I have used String text = System.Web.HttpUtility.UrlDecode(Request.QueryString["text"], System.Text.Encoding.UTF8); ...

Problem with i18n in JSF 2.0 (ukrainian & russian)

Hi! Goal: I want to use jsf`s i18n Scenario: creating resource bundle (utf-8) file info: file -I ./messages.properties ./messages.properties: text/plain; charset=utf-8 using it by faces-config: <application> <locale-config> <default-locale>uk_UA</default-locale> <supported-locale>en_US</supported...

Java and SEO friendly URLs: ©reate ╨ a valid http URL from a string composed by special caracters

I'm trying to extract SEO friendly URLs from strings that can contain special characters, letter with accents, Chinese like characters, etc. SO is doing this and it's translating this post title in java-and-seo-friendly-urls-reate--a-valid-http-url-from-a-string-composed-by-s I'm trying to do this in Java. I'm using this post solut...

How do I find the character encoding of a ms access database?

How do I find out what character encoding the tables in my MS Access 2003 database have? For example: Windows-1252 ISO 8859-1 US-ASCII ...

HTML5/H.264 format for desktop sharing

I am trying to live stream my desktop over http to one or more users. The immediate goal is just to share the desktop in real time in read only mode for several users. In future we might allow the users to control the desktop as well by mouse or keyboard. I have these queries in this context- Is HTML5 + H.264 right for this or do I ne...

Best HTML encoder for Delphi?

Seems like my data is getting corrupted when using HTTPapp.HTMLEncode( string ): String; HTMLEncode( 'Jo&hn D<oe' ); // returns 'Jo&am' This is not correct, and is corrupting my data. Does anyone have suggestions for VCL components that work better? Other than spending my time encoding all the cases http://en.wikipedia.org/wiki/Lis...

Encoding mailto link with body text

I'm trying to create a mailto link using PHP. Basically my function gets the body text from database and then creates the html tag like this: <a href="mailto:?subject=sample&body=sometexthere">send</a> well, the problem is that my body text may contain non standard characters, like accents and so, so i need to encode the body text bef...

How to encode strings like edit does in C#?

I am writing C# application that need to print data to POS STAR printer using RawPrinterHelper. My printing works fine except when I sending characters like ŽĆČĐŠ. Then I get wrong data printed out. Until now my research give me following results. If I in PowerShell open good old edit and in txt file write my characters (ŽĆČĐŠ) and s...

How to convert string with ISO-8859-1 characters in Oct to normal form in Java

I have string with ISO-8859-1 characters in Oct (\350, ...). How to convert them to normal form, for example "\350" -> "è" in Java? ...