charset

Is there an upside down caret character?

I have to maintain a large number of classic ASP pages, many of which have tabular data with no sort capabilities at all. Whatever order the original developer used in the database query is what you're stuck with. So I want to to tack on some basic sorting to a bunch of these pages, and I'm doing it all client side with javascript. I ...

How to get rid of weird characters in my RSS feed?

Hi, I've created a utf8 encoded RSS feed which presents news data drawn from a database. I've set all aspects of my database to utf8 and also saved the text which i have put into the database as utf8 by pasting it into notepad and saving as utf8. So everything should be encoded in utf8 when the RSS feed is presented to the browser, howe...

What is the best way to determine the correct Charset for a given LCID at runtime in VB6?

I am displaying Japanese characters in a VB6 application with the system locale set to Japan and the language for non Unicode programs as Japanese. A call to GetACP() correctly returns 932 for Japanese. When I insert the Japanese strings into my controls they display as “ƒAƒtƒŠƒJ‚̏—‰¤” rather than “アフリカの女王”. If I manually set the Fon...

Getting the € with htmlentities

I am really trying to show what htmlentities gives me, but it doesn't give & euro; for the € character. I am trying echo htmlentities(htmlentities("LISBOA-VIENA DESDE 99€ TAXAS INCLUÍDAS, RESERVE JÁ",ENT_COMPAT,ISO-8859-1),ENT_COMPAT,ISO-8859-1); echo '<br>'; echo htmlentities(htmlentities("LISBOA-VIENA DESDE 99€ TAXAS INCLUÍDA...

How can I determine what the alphabet for a locale is in java ?

I would like to determine what the alphabet for a given locale is, preferably based on the browser Accept-Language header values. Anyone know how to do this, using a library if necessary ? ...

What are all of the allowable characters for people's names?

There are the standard A-Z, a-z characters, but also there are hyphens, em dashes, quotes, etc. Plus, there are all of the international characters, like umlauts, etc. So, for an English-based system, what's the complete set? What about sets for other languages? What about UTF8, UTF16, etc? Bonus question: How many name fields are nee...

Usable charset for Moldava

Hi! Does anybody knows which charset is used in Moldava. We've to prepare our software (and database) for Moldava. ;-) I guess UTF-8 should work, shouldn't it? Thank you in advance. Best regards Thomas ...

Java App : Unable to read iso-8859-1 encoded file correctly.

I have a file which is encoded as iso-8859-1, and contains characters such as ô . I am reading this file with java code, something like: File in = new File("myfile.csv"); InputStream fr = new FileInputStream(in); byte[] buffer = new byte[4096]; while (true) { int byteCount = fr.read(buffer, 0, buffer.leng...

Java : How to determine the correct charset encoding of a stream

With reference to the following thread: http://stackoverflow.com/questions/498636/java-app-unable-to-read-iso-8859-1-encoded-file-correctly What is the best way to programatically determine the correct charset encoding of an inputstream/file ? I have tried using the following: File in = new File(args[0]); InputStreamReader r = ne...

Jetty Response with no Charset

Hi, I'm using Jetty to test a webservice we have and I am trying to get it to respond with no charset under the content-type header. Does anyone know how to do this? I've tried intercepting the Response and setting the CharacterEncoding to null or "" but that gives Exceptions. I am using Jetty 6.1.6. ...

How to properly link to files with accents in their names in a LAMP envirionment

I have made a php script that iterates through files in a directory and outputs links to them. The file names are in Spanish so some contain characters like á, é, etc. My script works fine in my dev machine which is windows+apache+php, however it does not work in my hosting's server which is linux+apache+php, it does not find the file. ...

HTML entities and charset in IE

I am displaying html entity &#10003; (a check mark: ✓) in an html document that uses iso-8859-1 for the character set. In Firefox, it displays as a check mark. In IE, it displays as a square box. Switching to UTF-8 doesn't seem to make a difference. Is there a reliable way to display these entities in IE 6 & 7 without using images? ...

jQuery AJAX Character Encoding Problem

Hi everyone, I'm currently coding a French website. There's a schedule page, where a link on the side can be used to load another day's schedule. http://aquate.us/film/horaire.html (At the moment, only the links for November 13th and November 14th work) Here's the JS I'm using to do this: <script type="text/javascript"> function l...

phusion passenger charset

Hi All, I just installed Apache 2, Phusion Passenger and Rails 2.2.2 on Centos 5.2 64 bit server. My mysql database is latin1, and my setup works on php server, and my rails setup is also working besides character encoding with the data from database. I have set meta http-equiv="Content-Type" content="text/html;charset=iso8859-1" (omi...

Problem with CHARSET in MySQL create table

Hi this is my code and it returns "You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near 'VALUE, chapter VARCHAR(100) CHARACTER SET utf8 COLLA' at line 2" CREATE TABLE IF NOT EXISTS texts ( id SERIAL DEFAULT VALUE, ...

Detect whether charset exists in python

Is it possible to check in Python whether a given charset exists/is installed. For example: check('iso-8859-1') -> True check('bla') -> False ...

Broken accent characters when Copy / Paste into ASP .Net

I am copy pasting from an MS word document into an ASCX file. When I view the ascx file, the accented characters appear normally. BUT, when the page is rendered through my ASP.net application, the accented characters are broken: Une promenade dans un verger ensoleillé, un peau de pêche délicatement parfumée… Les plaisirs du pr...

Converting webpages from UTF-8 to ISO-8859-1 in linux

Anyone have a neat trick on how to convert a number of php and html files from UTF-8 to ISO-8859-1 in linux (Ubuntu)? ...

What codepage/charset should be used to interpret data coming from an MVS system into a Java Environment?

I've come into an interesting problem (as is often the case in interacting with legacy systems). I'm working on an application (which currently runs on a x86 Linux or Windows system) that can receive requests from a variety of systems, one of them being an MVS system. I am attempting to determine which codepage/charset I should be us...

Can I use Unicode to decode an HTTP request?

I understand that the default encoding of an HTTP Request is ISO 8859-1. Am I able to use Unicode to decode an HTTP request given as a byte array? If not, how would I decode such a request in C#? EDIT: I'm developing a server, not a client. ...