codepages

Codepages and encodings

Hi, Before anyone recommends that I do a google search on this, I have. I just need a bit more clarity around what codepages and encodings. If I use UTF8 encoding, and use an italian code page and then a french code page, does this mean ill get different characters even though the bytes havent changed? ...

PHP, MSSQL2005 and Codepages

I have a php script which accesses a MSSQL2005 database, reads some data from it and sends the results in a mail. There are special characters in both some column names (I know, it's terrible) and in the fields itself. When I access the script through my browser (webserver iis), the query is executed correctly and the contents of the...

How can I figure out what code page I am looking at ?

I have a device with some documentation on how to send it text. It uses 0x00-0x7F to send 'special' characters like accented characters, euro signs, ... I am guessing they copied an existing code page and made some changes, but I have no idea how to figure out what code page is closest to the one in my documentation. In theory, this s...

Tool to convert code source from a codepage to UTF-8?

I'm working on an open source project. The original project contains comments in russian and is using codepage 1251. I'm using codepage 1252 and the russian comments aren't displayed correctly in Visual Studio Express 2008, not nice but anyway I can't read russian. Someone using codepage 950 (traditional chinese) tried to compile the pro...

What is Codepage 0?

I'm using the Delphi function StringCodePage I call it on a string returned by a COM function (Acrobat Annotation getContents - see my other posts) and it returns 0. What is 0? Ansi? Thanks ...

Convert from Codepage 1252 (Windows) to Java, in Java

Hi! I have some strings in Java (originally from an Excel sheet) that I presume are in Windows 1252 codepage. I want them converted to Javas own unicode format. The Excel file was parsed using the JXL package, in case that matter. I will clarify: apparently the strings gotten from the Excel file look pretty much like it already is some...

code page values

apart from 1252 what other code page values can be used with getencoding() in vb.net? System.Text.Encoding.GetEncoding(1252).Getstring() ...

string encodings in python

Hello. In python, strings may be unicode ( both utf-16 and utf-8 ) and single-byte with different encodings ( cp1251, cp1252 etc ). Is it possible to check what encoding string is? For example, time.strftime( "%b" ) will return a string with text name of a month. Under MacOS returned string will be utf-16, under Windows with English ...

What codepage/charset should be used to interpret data coming from an MVS system into a Java Environment?

I've come into an interesting problem (as is often the case in interacting with legacy systems). I'm working on an application (which currently runs on a x86 Linux or Windows system) that can receive requests from a variety of systems, one of them being an MVS system. I am attempting to determine which codepage/charset I should be us...

Which codepage for html for eurpean languages? UTF-8 vs codepage 1252 in VS 2008

Hi there, I have been developing with some software for javscript that has a parser that reads in my JS and creates a compressed version, it kept failing as by default vs 2008 saves its files in UTF8 from what i see.. there was like a hidden couple of characters at the start of the file.. Forcing a save as 1252 sorted the issue, my que...

.Net unicode problem, vb6 legacy

Hi, I have a decryption routine in VB6. I now want the same decryption in C#. The strings that need decryption are in unicode, so I use Encoding.Unicode.GetString to read the input in C#. The input now looks exactly the same as in VB6. The first few characters in the loop are decrypted ok! Then I encounter a difference... The program ...

ANSI or OEM Codepage when using MME and DirectMusic?

Hello, I noticed that when reading MIDI port names from MME, the names are multi-byte strings encoded using the ANSI Codepage, which my app uses by default. When receiving those names from the DirectMusic driver, the names are wide-character strings encoded with the OEM Codepage. See this article by Raymond Chen for a quick refresher on...

HtmlElementEventArgs KeyPressedCode Confusion

I'm using the following code to decide if a '.' (full stop) has been entered into a webbrowser control: private void body_KeyUp(object sender, HtmlElementEventArgs e) { if (e.KeyPressedCode == '.') { // Do something } } According to msdn KeyPressedCode returns an ASCII value. What I get ...

How to get the code page of the current keyboard layout?

My non-Unicode application needs to be able to process Unicode keyboard input (WM_CHAR/etc.), thus receive the 8-bit character code then internally convert it to Unicode. 9x-compatibility is required, so using most Unicode APIs is not an option. Currently it looks at the language returned by PRIMARYLANGID(GetKeyboardLayout(0)), and loo...

Sharepoint Changing URL Special characters

I have a URL stored in a Sharepoint 'Link to a document' that contains the Nordic Character 'Ø' as hex(%D8). Sharepoint is encoding the % symbol, (thus making it %25D8). If I enter the URL directly into an address bar, it works fine. If I don't URL encode it, I still get the same problem. Anybody got any ideas? ...

How do you specify a Java file.encoding value consistent with the underlying Windows code page?

I have a Java application that receives data over a socket using an InputStreamReader. It reports "Cp1252" from its getEncoding method: /* java.net. */ Socket Sock = ...; InputStreamReader is = new InputStreamReader(Sock.getInputStream()); System.out.println("Character encoding = " + is.getEncoding()); // Prints "Character encoding = Cp...

What's the code page of utf8?

My cmd promt's default code page is 936. I need to change it to utf8. chcp 65001 The above doesn't work,what's the correct one? ...

Dummy's guide to Unicode

Could anyone give me a concise definitions of Unicode UTF7 UTF8 UTF16 UTF32 Codepages How they differ from Ascii/Ansi/Windows 1252 I'm not after wikipedia links or incredible detail, just some brief information on how and why the huge variations in Unicode have come about and why you should care as a programmer. ...

Creating tar archive with national characters in Java

Hi, Do you know some library/way in Java to generate tar archive with file names in proper windows national codepage ( for example cp1250 ). I tried with Java tar, example code: final TarEntry entry = new TarEntry( files[i] ); String filename = files[i].getPath().replaceAll( baseDir, "" ); entry.setName( new String( filename.getBytes...

Is codepage 65001 and utf-8 the same thing?

<%@LANGUAGE="VBSCRIPT" CODEPAGE="65001"%> <!--#include file="conn.asp"--> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"&gt; <html xmlns="http://www.w3.org/1999/xhtml"&gt; <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> Is the above c...