unicode

Is there a way to detect an alphanumeric Unicode symbol?

I have a Unicode string consisting of letters, digits and punctuation marks. Ho can I detect characters that are digits and letters (not necessarily ASCII) with a C++ standard library or Win32 API? ...

Unicode lowercase characters?

I read up someplace, that there are characters other than A-Z that have a lowercase equivalent, in Unicode. Which could these be, and why would any other character need an upper and lower case? ...

Loading Unicode Characters from XML and push into Form via AJAX

How can I load Unicode Characters (specifically a trademark symbol) from XML and push it into a Form via AJAX? When I push formelement.value += "\u2122" it works fine, but when I place "\u2122" into XML, load it via AJAX, and push it to the form element it does not work. Any ideas? Thanks ...

Python String Cleanup + Manipulation (Accented Characters)

I have a database full of names like: John Smith Scott J. Holmes Dr. Kaplan Ray's Dog Levi's Adrian O'Brien Perry Sean Smyre Carie Burchfield-Thompson Björn Árnason There are a few foreign names with accents in them that need to be converted to strings with non-accented characters. I'd like to convert the full names (...

How to insert a unicode character in a rich text box?

I'm writing a WPF application with a RichTextBox and a Toolbar (a VERY simple RichText Editor). Anyway, I want to put 6 unicode characters in the ToolBar that can be clicked and will be inserted at whatever point to the RichTextBox. This seems like something that should be fairly simple to do (without a lot of codebehind). Is it, or do ...

fopen non-ascii character error

i cannot use fopen on files includes in their name some characters (example : ş, ç, ı) how can i use fopen on these files ? i'm using vc++ 6 (i have to) and c language. when i was trying to use _wfopen it's never open any file. ...

Converting Greek characters to Unicode

Is there any easy way of converting a windows-1252 string into a Unicode one? ...

Python IRC bot and encoding issue

Currently I have a simple IRC bot written in python. Since I migrated it to python 3.0 which differentiates between bytes and unicode strings I started having encoding issues. Specifically, with others not sending UTF-8. Now, I could just tell everyone to send UTF-8 (which they should regardless) but an even better solution would be tr...

MySQL C# Text Encoding Problems

I have an old MySQL database with encoding set to UTF-8. I am using Ado.Net Entity framework to connect to it. The string that I retrieve from it have strange characters when ë like characters are expected. For example: "ë" is "ë". I thought I could get this right by converting from UTF8 to UTF16. return Encoding.Unicode.GetString(...

How do I get my form submission to understand unicode characters?

When some users of my ASP.NET app enter the following text Je bénéficie d’un coaching I get the following value in my Request.Form: Je b�n�ficie d�un coaching At first my page had the encoding set to Latin1, and I thought that changing it to Unicode (UTF8) using the following meta tag would help: <meta http-equiv="Content-...

"Delphi Fundamentals" in Delphi 2009

Hello, Has anybody used/converted "Delphi Fundamentals" in Delphi 2009? - http://fundementals.sourceforge.net/ I'm using Dictionaries (cArrays.pas,cDictionaries.pas,cStrings.pas,cTypes.pas) in my project and now i have some troubles on upgrading code. I'll be highly obliged if anybody can convert the above mentioned units in Delphi 200...

using pyodbc on linux to insert unicode or utf-8 chars in a nvarchar mssql field

I am using Ubuntu 9.04 I have installed the following package versions: unixodbc and unixodbc-dev: 2.2.11-16build3 tdsodbc: 0.82-4 libsybdb5: 0.82-4 freetds-common and freetds-dev: 0.82-4 I have configured /etc/unixodbc.ini like this: [FreeTDS] Description = TDS driver (Sybase/MS SQL) Driver = /usr/lib/odbc/libt...

Importing D7 DPR in Delphi 2009: "Invalid character in text content"

I'm trying to open old projects generated in Delphi 7 with Delphi 2009. On some projects, this causes the following error message from the IDE: "An invalid character was found in text content" The affected projects are working fine in the old IDE. How can I solve this? ...

Reading a Unicode file in C and passing contents as ASCII via sockets

Hey, I've being trying to figure this out, but nothing seems to work. We have an application that reads thousands of transactions files using the normal "fopen fgets etc", which we parse using normal C functions "strstr, strchr, etc" and return back a normalized char *. However, now we need to read some files that are in Unicode (from ...

How to call Microchip PIC USB DLL with Delphi 2009

Not a question, just a statement to help anyone else who might spend hours getting the Microchip PIC USB DLL to work properly with Unicode. It expects several strings, and although these are simple PAnsiChar, getting the right combination of DLL call convention took me ages. There are a lot of people on the 'net using Delphi (non-Unicod...

Is it possible to convert UTF32 text to UTF16 using only Windows API ?

I'm trying to find converting UTF-32 text to/from any code page is possible using the Windows API alone. I cannot used CLR to do this task. The Code page identifiers page at Microsoft at http://msdn.microsoft.com/en-us/library/dd317756(VS.85).aspx lists UTF-32 as being available to only managed applicatiosn. ConvertStringTo/FromUnicode...

Would std::basic_string<TCHAR> be preferable to std::wstring on Windows?

As I understand it, Windows #defines TCHAR as the correct character type for your application based on the build - so it is wchar_t in UNICODE builds and char otherwise. Because of this I wondered if std::basic_string<TCHAR> would be preferable to std::wstring, since the first would theoretically match the character type of the applic...

How to get string Objects instead Unicode ones from JSON in Python?

I'm using Python (Python 2.5.2 on Ubuntu 8.10) to parse JSON from (ASCII encoded) text files. When loading these files with json (simplejson), all my string values are cast to Unicode objects instead of string objects. The problem is, I have to use the data with some libraries that only accept string objects. Is it possible to get stri...

Joomla to Static HTML website

Hi, I have a Hindi magazine website hosted on Joomla. Though helpful from publishing point of view the site was a maintenance nightmare. Joomla is so much susceptible to hacker attacks. My host will often shut down my site due to bots attacking my website. Recently I relaunched the site as a new Wordpress based site on a different name....

Handling Unicode surrogate values in Java strings

Consider the following code: byte aBytes[] = { (byte)0xff,0x01,0,0, (byte)0xd9,(byte)0x65, (byte)0x03,(byte)0x04, (byte)0x05, (byte)0x06, (byte)0x07, (byte)0x17,(byte)0x33, (byte)0x74, (byte)0x6f, 0, 1, 2, 3, 4, 5, 0 }; String sCompressedBytes = ...