questions about unicode

How to Show Eastern Letter(Chinese Character) on SQL Server/SQL Reporting Services?

I need to insert chinese characters in my database but it always show ???? .. Example: Insert this record. 微波室外单元-Apple Then it became ??? Result: ??????-Apple I really Need Help...thanks in regard. I am using MSSQL Server 2008 ...

sql-server

unicode

Remove null characters from WideString in Delphi 2006

I have a WideString variable containing some data but when the string was assigned some extra nulls where added at more or less random places in the data. I now need to strip these nulls out of the variable. If it had been a string I would have checked each Char to see if Char(x) = 0 but as this is a WideString I dont think this work? Ho...

delphi

unicode

What's the unicode for '¿' ?

Google hasn't been a help. ...

unicode

Sorl - unicode normalize

Let's say I have document indexed with apache Solr that contains this string - "Klüft skräms inför" I want to be able to find it with search using this keyword "kluft" (note the "u"-"ü") is there a way to do this ? ...

unicode

solr

Django/python is converting my post data from JavaScript

When I post a JSON string to Django by Ajax, it converts it into an invalid JSON format. Specifically, if I look in the post data in Firebug I am sending: info {'mid':1,'sid':27,'name':'aa','desc':'Enter info' } Yet when I access it in the django request I am seeing: u'{\'mid\':1,\'sid\':27,\'name\':\'aa\',\'desc\':\'Enter Info\'}...

Serializing wchar_t* in a struct with TPL

Hello all, I'm trying to use tpl to serialize structs that contain wchar_t* strings. The code I have looks like this, and it's not working: #include <stdio.h> #include <locale.h> #include <string.h> #include <wchar.h> #include "tpl.h" struct chinese_t { wchar_t *chars; }; int main() { tpl_node *tn; struct chinese_t cstr; cstr...

Unicode characters show differently in different browsers

So... I'm still in unicode hell... New problem... On my computer, everything shows perfectly. In all browsers. On a co-workers computer, same story. Everything is good. Even in elinks and w3m on one of my Linux VPS'es all the exotic diacritics of Lithuanian and Latvian, and nordic letters, shows perfectly. However, I have had a few ca...

PHP construct a Unicode string?

Given a Unicode decimal or hex number for a character that's wanting to be output from a CLI PHP script, how can PHP generate it? The chr() function seems to not generate the proper output. Here's my test script, using the Section Break character U+00A7 (A7 in hex, 167 in decimal, should be represented as C2 A7 in UTF-8) as a test: <?ph...

php

unicode

How do I get the length of a Perl Unicode string input via Ajax or CGI?

Okay, this should be really simple, but I have searched all over for the answer and also read the following thread: http://stackoverflow.com/questions/1326539/finding-the-length-of-a-unicode-string-in-perl It does not help me. I know how to get Perl to treat a string constant as UTF-8 and return the right number of chars (instead of byt...

How to serialize a string with double byte characters?

I want to serialize a string (Korean string) containing double byte characters. Do I need to do something special? I am able to serialize char * strings by escaping special characters like \n,\b,\f,\t etc. I am using C++ on Windows without any libraries. ...

c++

windows

unicode

Writing a better natural sort (than mine)

I added an answer to this question here: Sorting List<String> in C# which calls for a natural sort order, one that handles embedded numbers. My implementation, however, is naive, and in lieu of all the posts out there about how applications doesn't handle Unicode correctly by assuming things (Turkey test anyone?), I thought I'd ask for ...

Problem with regional characters (polish) in php application and ms sql server

I have a problem with regional string characters inserted to MS SQL Server database. There is a PHP application that connects with mssql server and inserts some data. But instead of inserting characters such as: ą, Ą, ć, Ł, ź (and so on - btw. these are polish regional characters), when inserted into mssql table they appear as a, A, c,...

C++ Unicode Bullet Point

Hi stackoverflow! I am trying to insert the Unicode character U+2022 (bullet •) in my C++ application. I can't figure out how to convert that U+2022 to a char/string for use in std::string constructor... char bullet = char(0x2022); mPassword.SetText( std::string(mText.length(), bullet) ); This one doesn't work. Hope you can help !!...

Strings in Python 3

Hi: I am programing VIX API from python 2.5, but now I want to port the code to python 3.2 This funtion opens the virtual machine: self.jobHandle = self.VixLib.vix.VixVM_Open(self.hostHandle, "C:\\MyVirtualMachine.vmx", None, None)e...

How do I encode a Binary blob as Unicode blob?

I'm trying to store a Gzip serialized object into Active Directory's "Extension Attribute", more info here. This field is a Unicode string according to it's oM syntax of 64. What is the most efficient way to store a binary blob as Unicode? Once I get this down, the rest is a piece of cake. ...

What's the best sequence to compress binary data, encode it, and convert to string?

I'm trying to store a gzipped binary serialized object into Active Directory's "Extension Attribute", more info here. This field is a Unicode string according to it's oM syntax of 64. I'm saving the binary object into AD's Unicode format like this: byte[] bytes = ... // This is my blob System.Text.Encoding.Unicode.GetString(bytes); ...

QChar to wchar_t

Hello, I need to convert a QChar to a wchar_t I've tried the following: #include <cstdlib> #include <QtGui/QApplication> #include <iostream> using namespace std; int main(int argc, char** argv) { QString mystring = "Hello World\n"; wchar_t myArray[mystring.size()]; for (int x=0; x<mystring.size(); x++) { my...

Unicode in CSV file?

I need to generate a CSV file. Maybe i am 'doing it wrong' because i am dumping the file with my own code instead of using a lib but anyways. It looks like i have everything right. Quotes, commas and everything seems to be escaped perfectly. It was rather easy. The problem is i am using unicode strings to test and they come out as ????....

.net

unicode

csv

See the exact byte sequence of an R string?

How can I get the byte sequence of a particular UTF-8 string? I'm seeing what looks like some bug in the regex engine which is only triggered in some edge cases, and I'd like to know exactly what data it's working on. ...

r

unicode

string

How to Determine "Lowest" Encoding Possible?

Scenario You have lots of XML files stored as UTF-16 in a Database or on a Server where space is not an issue. You need to take a large majority of these files that you need to get to other systems as XML Files and it is critical that you use as little space as you can. Issue In reality only about 10% of the files stored as UTF-16 ne...