utf-8

(PHP) rawurlencode/decode seems to encode '£' sign as '£' (%C2%A3 instead of %A3)

So, I've run into a problem with PHP's rawurlencode function. All text fields in our web app are of course converted before being processed by the web-server, and we've used rawurlencode for this. This works fine with almost every character I've found, expect for the "£" sign. Now, there is no reason for our users to ever enter a pound s...

Problems with UTF-8 accent Characters in JQuery form submission using Internet Explorer

Hi, I am using JQuery to submit a form with $.ajax() and then get the params in a php page to store them in a Database. My Pages are UTF8 encoded. Everything works well when I use Firefox. However, with Internet Explorer, when the data has accent characters in it, it discards these completely. I am using : $.ajaxSetup({ scriptCharset:...

Spring MVC: How to store € character?

Hi there, I am using Spring 3 MVC and I have setup a form to capture input from a user. This form includes a textarea for a description String in my model object, Event. My corresponding controller looks like this: @RequestMapping(value = "/admin/event/{eventId}/edit", method = RequestMethod.POST) public String updateEvent(@ModelAtt...

Why does the Java ecosystem use different character encodings throughout their software stack?

For instance class files use CESU-8 (sometimes also called MUTF-8), but internally Java first used UCS-2 and now it uses UTF-16. The specification about valid Java source files says that a minimal conforming Java compiler only has to accept ASCII characters. What's the reason for these choices? Wouldn't it make more sense to use the sam...

Grails encoding problem Gsp. Help!

I'm with trouble on groovy server pages (gsp) where utf-8 chars are showing not propertly, like "?". But this occur inside grails tags like "${}". Example: <p>Não</p> --> output browser: Não - OK <g:link title="Avaliação Relatório"....> output browser: Avaliaç?o Relat?rio ${value? "Sim" : "Não"} output browser: N?o The gsp already...

 enconding issue

I'm developing a website using PHP and these strange chars "" appears in my page, right on the top of it. My code is this: <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"&gt;&lt;?php echo '';?> <html xmlns="http://www.w3.org/1999/xhtml"&gt; <head> <meta http-eq...

XML file encoding format "utf-8" VS "UTF-8"?

Hello. When the XML file was convert to ASCII. It is different values for user at the three characters of utf and UTF. <?xml version="1.0" encoding="utf-8"?> <?xml version="1.0" encoding="UTF-8"?> I tried to create a new xml file with vs2005. utf-8 form file generated by default. which one is a more standard definition? thanks. ...

How do I remove  from the beginning of a file?

I have a CSS file that looks fine when I open it using gedit, but when it's read by PHP (to merge all the CSS files into one), this CSS has the following chars prepended to it:  PHP removes all whitespace, so a random  in the middle of the code messes up the entire thing. As I mentioned, I can't actually see these chars when I ope...

RSS reader Error : Input is not proper UTF-8 when use simplexml_load_file()

Hi Dear All, I'm using simplexml_load_file method for parsing feed from external source. My code like this $rssFeed['DAILYSTAR'] = 'http://www.thedailystar.net/latest/rss/rss.xml'; $rssParser = simplexml_load_file($url); The output is as follows : Warning: simplexml_load_file() [function.simplexml-load-file]: http://www.thedailysta...

html entities decoding in php

I seem to be completely unable to get around utf-8 character encoding. So I'm exporting content from a database as a utf-8 xml file. The software I am importing into is quite strict about character encoding, so I can't just put everything in CDATA tags. There's a whole bunch of weird characters, e.g. , already in the data. These are...

[php] replace \n with actual new line character code

Hi all, I'm pulling content from a DB that has been sanitized using mysql_real_escape_string. Accordingly the new line characters now appear as "\n". The issue is that this content is displayed to users inside a < pre > tag so I cannot replace \n with < br/> for instance. I suppose I could replace \n with the actual utf8 character code...

PHP UTF-8 to GB2312

Part of our web app has a little Ajax method that will load a page in an iFrame or allow you to download it. We store a bunch of search results from search engines and we have script opens the file containing our info and the search html. We strip out the stuff we don't need from the top (our info) and then we serve that up either by ec...

Wordpress COMMENTS IN URDU (other language)

I am working on a wordpress project. The my database is in utf general. It seems everything is fine in database side, the rest of the website is working wel except the comments. When i display comments wordpress just display symbols rather than words can anyone help me? ...

How to write a check in python to see if file is valid UTF-8?

As stated in title, I would like to check in given file object (opened as binary stream) is valid UTF-8 file. Anyone? Thanks ...

PHP + MySQL encoding issues for Portuguese (PT-Br)

I'm developing a website in Brazilian Portuguese and I'm facing some really annoying encoding problems. Words that should be written this way: óbito are being written this way: �bito I have noticed that while the texts are still at the database they are ok. But when I use echo with PHP, the encoding problem comes up. List of things I ...

Migrating data between two MySQL with different character_set%, messed up with utf8

Migrating Data from MySQL server1 to MySQL server2 server1 Ver 14.12 Distrib 5.0.51a, for debian-linux-gnu (x86_64) using readline 5.2 mysql> SHOW VARIABLES LIKE 'character_set%'; +--------------------------+------------------------------------------+ | Variable_name | Value | +------------...

Java + Mysql UTF8 Problem

Hi, as the title said, I have a problem between java and mysql The mysql DB, tables, and columns are utf8_unicode_ci. I have an application that took some input from an xml, then compose the query... public String [] saveField(String xmltag, String lang){ NodeList nodo = this.doc.getElementsByTagName(xmltag); String [] pos = ...

NSXMLParser with UTF8 Data

I'm having a really hard time trying to figure this out and would really appreciate any help. I'm trying to parse a utf 8 string with NSXMLParser but it won't work. here is my string <?xml version="1.0" encoding="UTF-8"?><host><type>mac</type><port>62181</port><address>192.168.1.159</address><hostname>Samuel’s%20Mac%20Book</hostname><...

CSV, DictWriter, unicode and utf-8

Hi, I am having problems with the DictWriter and non-ascii characters. A short version of my problem: #!/usr/bin/env python # -*- coding: utf-8 -*- import codecs import csv f = codecs.open("test.csv", 'w', 'utf-8') writer = csv.DictWriter(f, ['field1'], delimiter='\t') writer.writerow({'field1':u'å'.encode('utf-8')}) f.close() Giv...

How can I force XDocument to output "UTF-8" in the declaration line?

The following code produces this output: <?xml version="1.0" encoding="utf-16" standalone="yes"?> <customers> <customer> <firstName>Jim</firstName> <lastName>Smith</lastName> </customer> </customers> How can I get it to produce encoding="utf-8" instead of encoding="utf-16"? using System; using System.Collections.Generic; ...