unicode

Unicode sql query in cx_Oracle

i have the following: ora_wet = oracle_connection() cursor = ora_wet.cursor() sqlQuery = u"SELECT * FROM web_cities WHERE cty_name = 'София'" cursor.execute(sqlQuery) sqlResult = cursor.fetchone() When I do this I get a TypeError: expecting None or a string on line 18 which is the cursor.execute(sqlQuery) If I make the query non-uni...

Outputting a Unicode character in C#

I'm new to programming and self taught. I'm trying to output the astrological symbol for Taurus, which is supposed to be U+2649 in Unicode. Here is the code I'm using... string myString = "\u2649"; byte[] unicode = System.Text.Encoding.Unicode.GetBytes(myString); Console.WriteLine(unicode.Length); The result I'm getting is the numbe...

Unicode to a Form with Javascript

Why when I set a form with the numeric character reference of an unicode with javascript the character is not converted to its correct representation, but when I set the form directly from the html it works? <html> <head> <script language="javascript"> function test() { document.InputForm.TextBox.value = ...

Double byte Unicode shown as 2 characters by classic ASP

Problem with Unicode in asp These two address are created using the same function from a single DB entry the only diff that I can see is that the top is written to the page directly the bottom is written via JS in an Iframe to the parent page this is was the classic ASP outputs and it is wrong Fxxxx Åsbrink RSG connexion AB Baggäng...

"Lenient" regex matching of similar characters in C#/.Net

Is there a way to get .Net to positively match strings, even if some characters are not exactly the same? Examples of characters that should be considered to be similar could be: 'a'/'á' and 'í'/'i'. The Chrome browser find-as-you-type recognizes these characters as being equivalent. ...

DB2 Connect - Unicode Support from VB Application

I am trying to insert '€' char in Db2 Database. My Db2 database is in zOS. (v.8.0) and my Db2 client version is 9.1 FP5. I am trying this using ODBC Connection via ADODB in Visual Basic or C# code. But junk char is getting inserted. '€' symbol is not inserting. Is there any option to set the CodePage 1252 at connection level...

TNTUnicode components problem

I downloaded and tried installed the TNT Delphi Unicode components. I use Code Gear Delphi 2007. Then I opened the TntUnicodeVcl_D70.dpk and tried to build it. The following message appeared **The following changes must be made before this package can be compiled. Choose OK to apply these changes and continue compiling Remove TntAc...

How to store unicode data in a format that doesn't support utf-8

Okay, here's yet another character encoding question, demonstrating my ignorance of all things Unicode. I am reading data out of Microsoft Excel .xls files, and storing it in ESRI shapefiles .shp. For versions of Excel > 5.0, text in excel files is stored as Unicode. However, Unicode (and specifically UTF-8 support for shapefiles is i...

Unicode Regular Expressions - Fails at 343 characters

Hi, I am using the regular expression below to weed out any non-Latin characters. As a result, I found that if I use a string larger than 342 characters, the function fails, everything aborts, and the website connection is reset. I narroed it down to the \p{P} unicode character property, which matches any punctuation character. Does ...

php encoding problem with unicode

sorry, this is a rooky question, but i never had to handle this problem, i think. how can i get an htmlentity out of a thing like that: \u00e4, which stands for &auml; (ä) ?? additional information(, why i want to do that ^^): i have backslashes in the string, for escapereason. when i stripslashes i get something like "u00e4". to find...

Byte to String Python 3 Websockets

Hey. I'm converting a 2.6.5 application to python 3.1. I'm trying to convert a byte order received through socket.recv() into a string, by doing the following: str(temp, 'UTF-8', 'ignore') The problem is unknown characters are removed ("ignored"), such as \x00 and \xff (WebSockets characters), I do want to convert the byte order to a ...

Broken encoding after postback

I have a query string with a parameter value that contains the norwegian character å encoded as %e5. The page contains a form with an action attribute which is automatically filled by ASP.Net. When the URL is output into said attribute it is printed with a full two byte encoding: %u00e5. When posting back this seems to be ok when debug...

How do you convert posted "english" characters from international PC's in ASP.NET? (ex 2205)

I have a WebForm search page that gets occasional hits from international visitors. When they enter in text, it appears to be plain ASCII a-z, 0-9 but they are printed in bold and my "is this text" logic can't handle the input. Is there any easy way in ASP.NET to convert unicode characters that equate to A-Z, 0-9 into plain old text? Th...

Python unicode string with UTF-8?

I'm getting back from a library what looks to be an incorrect unicode string: >>> title u'Sopet\xc3\xb3n' Now, those two hex escapes there are the UTF-8 encoding for U+00F3 LATIN SMALL LETTER O WITH ACUTE. So far as I understand, a unicode string in Python should have the actual character, not the the UTF-8 encoding for the character...

Displaying other language characters in PyQt

Is there a way to display other language characters in PyQt4? and if there is, what's the approach/direction that I should take? Thanks in advance. ...

Unicode Woes! Ms-Access 97 migration to Ms-Access 2007

Problem is categorized in two steps: Problem Step 1. Access 97 db containing XML strings that are encoded in UTF-8. The problem boils down to this: the Access 97 db contains XML strings that are encoded in UTF-8. So I created a patch tool for separate conversion for the XML strings from UTF-8 to Unicode. In order to covert UTF8 strin...

Python unicode problems on Windows XP

Having the following django view code that generates a CSV response from a database view: def _get_csv_stats(request, **filterargs): result = GlobalStats.objects.select_related().filter(**filterargs).values_list('user__username', 'user__first_name','use...

BeautifulSoup doesn't give me Unicode

I'm using Beautiful soup to scrape data. The BS documentation states that BS should always return Unicode but I can't seem to get Unicode. Here's a code snippet import urllib2 from libs.BeautifulSoup import BeautifulSoup # Fetch and parse the data url = 'http://wiki.gnhlug.org/twiki2/bin/view/Www/PastEvents2007?skin=print.pattern' dat...

Goal: How to handle the uni codes present in the file name.

My goal is to handle the Unicode characters present in the file name, such that the uni code gets replaced by an underscore. Any help would be deeply appreciated ...

Replace national characters with ASCII equivalent.

Is there any lib that can replace national characters to ASCII equivalents, like: "Cześć" to: "Czesc" I can of course create map: {'ś':'s', 'ć': 'c'} and use some replace function. But I don't want to hardcode all equivalents into my program, if there is some function that already does that. ...