unicode

Facelets charset problem

Hi! In my earlier post there was a problem with JSF charset handling, but also the other part of the problem was MySQL connection parameters for inserting data into db. The problem was solved. But, I migrated the same application from JSP to facelets and the same problem happened again. Characters from input fields are replaced when in...

Unicode symbols coming wrong

Obviously, there must be something stupid i'm doing. The unicode chart for subscripts and superscripts says #00B2 is superscript 2, but i get scrambled output. 0078 is x, but I get N, and 0120 is x. Am i reading wrong manual? EDIT $x = 'N'; print html_entity_decode($x, ENT_NOQUOTES, 'UTF-8') . "\n"; ...

reading non-english html pages with c#

I am trying to find a string in Hebrew in a website. The reading code is attached. Afterward I try to read the file using streamReader but I can't match strings in other languages. what am I suppose to do? // used on each read operation byte[] buf = new byte[8192]; // prepare the web page we will be asking for HttpWebRe...

How to use Unicode in C++?

Assuming a very simple program that: ask a name. store the name in a variable. display the variable content on the screen. It's so simple that is the first thing that one learns. But my problem is that I don't know how to do the same thing if I enter the name using japanese characters. So, if you know how to do this in C++, please ...

How do I convert filenames from unicode to ascii

I have a bunch of music files on a NTFS partition mounted on linux that have filenames with unicode characters. I'm having trouble writing a script to rename the files so that all of the file names use only ASCII characters. I think that using the iconv command should work, but I'm having trouble escaping the characters for the 'mv' com...

Is Embed Resource a good approach for a read only xml database?

I have an open source application (here) This application get a character or a sentence and give some unicode information about it. Iuse Unicode Character Database which provided by Unicode.org this is a XML document (130MB) At first I embed this XML to my DLL but I don't know is it a good approach or no. because DLL size growth just ...

How to set UCS2 in numpy?

I'm trying to build numpy 1.2.1 as a module for a third-party python interpreter (custom-built, py2.4 linux x86_64) so that I can make calls to numpy from within it. Let's call this one interpreter A. The thing is, the system-wide python interpreter (also py2.4, let's call it B) from the vendor is built with --enable-unicode=ucs4, while...

How to deal with Polish Characters while using regex ?

I have street name as KRZYWOŃ ANIELI and so what should be my regex to allow this kind of expression. Currently I have simple one which uses /^[a-zA-Z ]+$/ Kindly advise. ...

Getting readable diff displays in Mercurial on Unicode files (MS Windows)

I'm trying to store some Windows PowerShell scripts in a Mercurial repository. It seems the PowerShell editor likes to save files as UTF-16 Unicode. This means that there are lots of \0 bytes, which is what Mercurial uses to distinguish between "text" and "binary" files. I understand that this makes no difference to how Mercurial stores...

pyODBC and Unicode Problem

Hey guys, I'm working with pyODBC communicate with a MS SQL 2005 Express server. The table to which i'm trying to save the data consists of nvarchar columns. query = u"INSERT INTO tblPersons (name, birthday, gender) VALUES('" query = query + name + u"', '" query = query + birthday + u"', '" query = query + gender + u"')" cur.exe...

regex unicode charater in vim

I'm being an idiot. Someone cut and pasted some text from microsoft word into my lovely html files. I now have these unicode characters instead of regular quote symbols, (i.e. quotes appear as <92> in the text) I want to do a regex replace but I'm having trouble selecting them. :%s/\u92/'/g :%s/\u5C/'/g :%s/\x92/'/g :%s/\x5C/'/g .....

What's the Unicode code point for [ \u8D27 ] ?

I want to find out if a Chinese character can be displayed, the unidode for it is "\u8D27", how to use the Java Font method canDisplay ? It takes an int, but "8D27" is not an integer, how does it work, do I need another method to translate "8D27" to an int then use canDisplay ? If so how to translate it ? Edit : To be more precise, how ...

VB.NET - Convert Unicode in one TB to Shift-JIS in another TB

Trying to develop a text editor, I've got two textboxes, and a button below each one. When the button below textbox1 is pressed, it is supposed to convert the Unicode text (intended to be Japanese) to Shift-JIS. The reason why I am doing this is because the software VOCALOID2 only allows ANSI and Shift-JIS encoding text to be pasted in...

Python's string.translate() doesn't fully work?

Given this example, I get the error that follows: print u'\2033'.translate({2033:u'd'}) C:\Python26\lib\encodings\cp437.pyc in encode(self, input, errors) 10 11 def encode(self,input,errors='strict'): ---> 12 return codecs.charmap_encode(input,errors,encoding_map) 13 14 def decode(self,input,errors='...

How to write to a file in Unicode in Vb.Net

How should I modify the following Vb.Net code to write str to the file in unicode? Do I need to convert str to Unicode before writing to the file? Using sw As StreamWriter = New StreamWriter(fname) sw.Write(str) sw.Close() End Using ...

VBA or Vb Scripting: how to write to a file in Unicode using FileSystemObject

How would you use FileSystemObject to write to a string to a file in Unicode? ...

Are there standard ways and guidelines on extending Unicode with custom user-defined character sets?

In a linguistic project of mine I need to use symbols not contained in Unicode. I can draw the font, but would like to avoid overwriting characters defined by the standard. Are there any standard solutions for such cases? As reserved ranges etc. I'd need common Unicode-aware software to handle it seamlessly. ...

Printing Unicode in eclipse Pydev console and in Idle

My configuration: Win7 + Python 2.6 + eclipse + PyDev How do I enable Unicode print statements in: PyDev console in eclipse Idle Python GUI Example print statement: print(u"שלום עולם") This comes out as: ùìåí òåìí ...

What does unicode character &#10; represent?

The unicode is &#10; and it's being used in an XML document. ...

Nvarchar + PDF encoding problem in Reporting Server 2005

I have a Report running on SQL 2005, say there are 2 fields I got from the database to show on the report: (A) =Field!FullName.Value (defined as varchar(250) in database) (B) =Field!Description.Value (defined as nvarchar(max) in database) When I export it to PDF, it shows up properly in Acrobat Reader. "FullName" (A) can be copi...