encoding

UTF8 encoded text cones in a TDesC8 variable from Xml-parser in Symbian

Hi! I'm using Carbide.c++ 2.0 to create a S60 application that consumes my own webservice. I've used the Yahoo! Image Search example as a starting point and data is shipped back and forth nicely. Problems occur when there are non-english characters in the reply so the text presented to the user in the listbox is incorrect. The XML-docume...

How to write a global save hook for Emacs?

I usually have to deal with a lot of people sending me Windows files and I'm tired of having to convert them by hand, usually by doing: C-x RET f unix RET Is there any way to add a global hook (for all modes) to automatically do this on saves? ...

Ruby Email Client Recommendation

We are writing an email web client in Ruby to handle (potentially international) emails. I am looking for a high-level email library that supports retrieving emails, parsing email raw, decoding MIME, and converting input into UTF-8. Is there a library that you can recommend? ...

How to store arbitary text (including '\') in the Windows registry

I am storing some short user data strings in the Windows registry. This worked fine until one of my users tried to store a string containing a '\' character. This was then interpreted as a registry path delimiter. What is the best practice for avoiding this problem? Some sort of simple encoding? Note/ I am using the Qt QSetting class t...

Replace broken encoding

Is there a way in Zend Studio (6/7)/Eclipse to replace all broken characters, when you've accidentally saved a file with the wrong encoding? ...

What is proper encoding for converting a string to a byte array

I am having some sort of problem with encoding in my ASP.NET HTTPHandler, which uploads a file. The file content is passed in a hidden form variable from a ColdFusion web page which is using something called "ToBase64". In ColdFusion, the code used to place the file content into a form is as follows: <cffile action="readBinary" file="#...

Hibernate Oracle and Character Encoding

Im having an issue at the moment where my testing environment is not persisting the trademark character "™" in the database. When I enter in the form data, then submit it, I can see that on the server side, the request has the correctly encoded ™ character, but then when the call to hibernate's "saveOrUpdate()" method is called, the data...

Encoding apostrophe

i am building up a string on the server that is getting put into a javascript variable on the client. what is the best of encoding this to avoid any issues right now on the server i am doing something like this: html = html.Replace("'", "&#39;"); but i assume there is a more elegant fool proof way of doing stuff like this. ...

Mail problem with Hungarian characters

Hi, I have a multipart mail which is processed using async read on sockets. The problem is that instead of hungarian diacritics is displayed something like =ED which is the code in hexa of character. I suppose that is some encoding pb. I've tried several encoding from iso-8859-2 to utf7. This is a part from mail header: X-MS-Has-Attac...

Smart HTML encoding

I'm looking for the best way to do some sort of "smart" HTML encoding. For instance: From: <a>Next >></a> to: <a>Next gt;gt;</a> From: <p><a><b><< Prev</b></a><br/><a>Next >></a></p> to: <p><a><b>&lt;&lt; Prev</b></a><br/><a>Next gt;gt;</a></p> So only the non XML / HTML part of the text would be encoded as if HtmlEncode is called. A...

Unicode utf-8/utf-16 encoding in Python

In python: u'\u3053\n' Is it utf-16? I'm not really aware of all the unicode/encoding stuff, but this type of thing is coming up in my dataset, like if I have a=u'\u3053\n'. print gives an exception and decoding gives an exception. a.encode("utf-16") > '\xff\xfeS0\n\x00' a.encode("utf-8") > '\xe3\x81\x93\n' print a.encode("utf-8...

Sending pound signs from Delphi to C# web service

I am sending a large string from Delphi 5 to a C# web service, and I'm having lots of trouble with Pound (£) signs. I URLEncode the string from the Delphi side (which seems to convert them to '%A3'). When it reaches the C# web services it appears as '�'. I have tried changing the encoding of the string on the C# side by using a StreamRea...

Handling Character Encoding in URI on Tomcat

On the web site I am trying to help with, user can type in an URL in the browser, like following Chinese characters, http://localhost:8080?a=测试 On server, we get GET /a=%E6%B5%8B%E8%AF%95 HTTP/1.1 As you can see, it's UTF-8 encoded, then URL encoded. We can handle this correctly by setting encoding to UTF-8 in Tomcat. However...

Encoding problem - Win Xp vs Win 2003

There are 2 machines, one with Xp one with Win 2003. Both are Japanese versions! If you get the available encodings (via Regional Settings or programatically, Encoding.GetEncodings() ) this happens: on Xp, the resulting list is in Latin characters on Win 2003, the list is in Japanese characters Some guy at work though to persist t...

Searching a Unicode file using Python

Setup I'm writing a script to process and annotate build logs from Visual Studio. The build logs are HTML, and from what I can tell, Unicode (UTF-16?) as well. Here's a snippet from one of the files: c:\anonyfolder\anonyfile.c(17169) : warning C4701: potentially uninitialized local variable 'object_adrs2' used c:\anonyfolder...

Combining semacodes and steganography?

This is an idea I have been thinking about, but I do not quite know how to solve it. I would like to know if any solutions like this exists out there, or if you guys have any idea how this could be implemented. Steganography Steganography is basically the art of hiding messages. In modern days we do this digitally by e.g. modifying the...

Character Support Issue - How to Translate Higher ASCII Characters to Lower ASCII Characters

So I have an ASP.Net (vb.net) application. It has a textbox and the user is pasting text from Microsoft Word into it. So things like the long dash (charcode 150) are coming through as input. Other examples would be the smart quotes or accented characters. In my app I'm encoding them in xml and passing that to the database as an xml p...

Encoding a date into an ASCII barcode

I am investigating encoding of date information into a linear barcode that must be as short as possible. One idea we have had is encoding date information rather than using the ASCII characters for the date numerals. The earliest date to be encoded is 01-JAN-2009 Date must be encoded using printable ASCII characters ...

Linux vs. Windows: How does the console render unicode characters?

This is quite a low-level (low in the sense of "closer to the metal") question. I was wondering if any of you could point me to documentation, explanations, etc. of how, upon receiving a Unicode character (or any character code, but I'm particularly interested in the Unicode Standard) the console in Windows, good ol' cmd.exe (using, say...

Encoding in Streamreader in my silverlight application

Having trouble geting the encoding right in my silverlight application. I need support for western europe letters like æ,ø,å,â and so(Latin1??). But I can't get it right. What should be instead of SOMEENCODINGHERE? did try Encoding enc = Encoding.GetEncoding("Latin1"); but no names I used as param was recognized =( . If I use Encoding.U...