unicode

Unicode Code Point for Command Key Combinations

Hi. Can someone please tell me how to determine the unicode character point of a multi-key combination that includes the "command" key? For example, if a user presses the "command" key and "1" key on the keyboard at the same time, what is the unicode character representation for that? Maybe I'm searching on the wrong thing, but I am not...

Unicode Character Corrupting Text

I have a weird situation that has been puzzling me for weeks. My project, Textual, is an IRC client for Mac OS X. It works great except one problem. DCC SEND messages via the DCC protocol do not work properly. For some reason the first section of the IP address sent for which connections will be established is always cut off. For example...

LaTeX Question - Accents on characters

I refuse to believe that no one on stackoverflow can help me! Tone marks above Chinese characters in latex / Combining Accents in unicode My aim is to put tone marks above Chinese characters in latex, and google seems to not be letting on to the answer. Is it possible to use combining accents with chinese characters or can they only b...

text with unicode escape sequences to unicode in python

Possible Duplicate: Conversion of strings like \\uXXXX in python Hi, suppose I have the string test '\\u0259' Note the escaped backslash. How do I convert it to the respective unicode string? ...

Getting the unicode value of a char in VB

How can I get the unicode value of a char? For example, I know that I can do this with ascii: i = Asc("a") // i == 97 (correct) What if I have a unicode char though? i = Asc("•") // i == 149 (incorrect... should return 8226) Obviously the second example doesn't work since that character is not in the Ascii set. Is there an equival...

Character encoding problem

Greetings, I'm developing a project in C++ where I want to use characters like á é õ and ┌ ─ ┐ │ to draw a couple of nice frames. My doubt resides in what I should change in my code/project settings since, without any kind of modifications, the console just prints pseudo-random characters. I know that the above characters are defined ...

can't encode single quote (') using django's render_to_string

hello, i have a problem with django's render_to_string and encoding single quotes. ... = render_to_string('dummy.txt', {'request':request, 'text':text,}, context_instance=RequestContext(request))) why are only these quotes translated to '#39;' and all other special characters not? ...

Problem parsing unicode escape in a Java 6 String literal...?

Why does this compile in java 6 (Sun 1.6.0_16): System.out.println("\u000B"); ... but not this: System.out.println("\u000A"); On this program: public class Test { public static void main(String argv[]) { System.out.println("\u000A"); } } I get a Test.java:3: unclosed string literal System.out.println("\u000A"); What's ...

Unicode and console interpretation

I print to the standard output some characters from a wide UTF-8 range in a Java application. My console is configured for UTF-8 support. My problem is that sometimes, when I decide to print 10 characters for example, I see a number of character which is less then 10. I think this is due to the console which interprets some characters. ...

Add accent marks with custom colors in .Net's RichTextBox

Using the Text property of a WinForms' RichTextBox, you can show a letter with an accent above it, in a way that the letter is one character, and the accent is another one. I.e., also visually it looks like one character, it actually consists of two. In Unicode terms, the letter is called a 'base character', while the accent is called 'c...

Objective C, unescape unicode characters

Does anyone have better solution or ideas than http://stackoverflow.com/questions/2099349/using-objective-c-cocoa-to-unescape-unicode-characters-ie-u1234 for unescaping unicode characters in NSString? ...

Removing non-english words from a sentence in python

Hi, I have written a code which sends queries to Google and returns the results. I extract the snippets(summaries) from these results for further processing. However, sometime non-english words are in these snippets which I don't want them. for example: /\u02b0w\u025bn w\u025bn unstressed \u02b0w\u0259n w\u0259n/ I only want the "uns...

Delphi XE Unicode dealing with nchar(1) fields

Hello I'm using a dbGo TADODataset in Delphi XE to read data from a SQL Server 2008 table. All the string fields are nvarchar or nchar. The nvarchar fields map to TWideStringField with no problems. The problem is with an nchar(1) field, this maps to a TStringField fine but not to a TWideStringField - when I configure the field to be T...

Java and unicode

Hi, I'm outputting a java string using the format - test\u00F3here Instead of 'testóhere' being outputted just 'óhere' is outputted. 'test' is being removed from the string , I dont know why this is occuring ? Is it something to do with escaping the unicode format ? Thanks ...

Java regex match characters outside Basic Multilingual Plane

How can I match characters (with the intention of removing them) from outside the unicode Basic Multilingual Plane in java? ...

Inserting UNICODE characters in Sql .

Hi , I am trying to get unicode strings into an SQL*Plus: Release 10.2.0.2.0 database but am having difficulties. If I use SQLPlus and copy and paste the insert statement into the database, any special characters are inserted as ? or something like that. I then try to call a sql file that has been encoded to UTF-8 and the outcome is th...

Problem with TextBox when typing Unicode text in Silverlight

Hi all. I am writing silverlight application. UI language is Armenian. I have added Unicode font to my application. For textblocks text is showing normal. But if I am typing text in TextBox it shows some symbols instead of text which I have typed. If I copy/past Unicode armenian text TextBox shows it normal. The only problem is with typi...

trouble using xhtml2pdf with unicode

I've been trying to convert Hebrew html files without success; the Hebrew characters show up in the output PDF as black rectangles regardless of any encoding I tried. I tried some unicode test files included in the pisa distribution: pisa-3.0.33\test\test-unicode-all.html and \test-bidirectional-text.html . I ran xhtml2pdf from the c...

Cleaning and stripping of strings/HTML - Python

Hi folks, I have a set of questions, of which I do not have an answer to. 1) Stripping lists of string input: 'item1, item2, \t\t\t item3, \n\n\n \t, item4, , , item5, ' output: ['item1', 'item2', 'item3', 'item4', 'item5'] Anything more efficient than doing the following? [x.strip() for x in l.split(',') if x.strip()] 2) Clea...

Decoding Java's JSON Unicode values with PHP

I had experienced different JSON encoded value for the same string depending on the language used in the past. Since the APIs were used in closed environment (no 3rd parties allowed), we made a compromise and all our Java applications are manually encoding Unicode characters. LinkedIn's API is returning "corrupted" values, basically the ...