accents

Outputing json with well formed accents

Hello, I have an anoying problem that is giving me a hard time these days... I would like to develop a few webservices for my own usage and currently i am fighting with my damn french accents to be rendered correctly in my json outputs. Here is my scenario: I retrieve a number of lines from my database that i put in a dict. What i want...

How to know if a string contains accents

How to know if a string contains accents? ...

How to remove accent characters from an InputStream

I am trying to parse a Rss2.0 feed on Android using a Pull parser. XmlPullParser parser = Xml.newPullParser(); parser.setInput(url.open(), null); The prolog of the feed XML says the encoding is "utf-8". When I open the remote stream and pass this to my Pull Parser, I get invalid token, document not well formed exceptions. When I sa...

Erlang - Eccentricity with accented characters and string literal

Hey, I am trying to implement a function to differentiate between french vowels and consonnants. It should be trivial, let's see what I wrote down : -define(vowels,"aeiouyàâéèêëôù"). is_vowel(Char) -> C = string:to_lower(Char), lists:member(C,?vowels). It's pretty simple, but it behaves incorrectly : 2> char:is_vow...

Handling different non-accented versions of Umlaut characters

The German accented Umlaut characters “ö”, “ä” and “ü” are often replaced with non-accented versions when users type, often for convenience when they do not have the correct keyboard. With most accented characters there is a particular non-accented version that most people use. The accented “è”, for instance, is always replaced with a s...

PHP include html page charset problem

Hello, after querying a mysql db using the code below i have generated an html file: $myFile = "page.htm"; $fh = fopen($myFile, 'w') or die("can't open file"); fwrite($fh, $row['text']); fclose($fh); On the msql db the text is encoded using utf8_general_ci. But i need to include it in a php web page as shown below: <?includ...

Transforming some special caracters (é,è,...) into "e"

Hello, I have a question related to Java in my android application. I have a list with some Belgian cities with accentued ( is that the right english word?) characters: (Liège,Quiévrain,Franière, etc.) and I would like to transform these special characters to compare with a list containing the same names in upper case, but without thes...

accent not showing in jQuery load

Hello, I'm working on a small site that I would like to change a little bit so it is more dynamic. The index .html is something like <html> <head> <meta http-equiv="content-type" content="text/html; charset=iso-8859-1"> ... </head> <body> <div id="page"> <div id="header"></div> <div id="main"> <div i...

Flex - compare string without considering the accents

Hi, I have a compare routine ... but I need it to understand that when i search a "e" I also search for "é" or "è". Is there an esay way to do that or do I really need to search and replace every accentued caractere before comparing ? Thanks ...

Java - getting rid of accents and converting them to regular letters

Is there a better way for getting rid of accents and making those letters regular apart from using String.replaceAll() method and replacing letters one by one? Example: Input: orčpžsíáýd Output: orcpzsiayd It doesn't need to include all letters with accents like the Russian alphabet or the Chinese one. ...

PHP: Fixing encoding issues with database content - removing accents from characters

Hi everyone, I'm trying to make a URL-safe version of a string. In my database I have a value medúlla - I want to turn this into medulla. I've found plenty of functions to do this, but when I retrieve the value from the database it comes back as medúlla. I've tried: Setting the column as utf_8 encoding Setting the table as utf_8 ...

How to convert an accented character in an unicode string to its unicode character code using Python?

Just wonder how to convert a unicode string like u'é' to its unicode character code u'\xe9'? Thank you for your help. ...

Saving accented characters from PHP script to Oracle DB

Hello, i'm trying to save accented chars (èòàèì) to an Oracle DB in a VARCHAR2 field; i've put <html> <head> <meta http-equiv="Content-type" value="text/html; charset=utf-8"> </head> <body> <?php header('Content-type: text/html; charset=utf-8'); .... //and here i make the insert into the DB: $str=utf8_encode("JeanPièrre"); // or...

Accent insensitive search in InnoDB MySQL table!

Hello, I am working on a simple search script that looks through two columns of a specific table. Essentially I'm looking for a match between either a company's number or their name. I'm using the LIKE statement in SQL because I am using InnoDB tables (which means no fulltext searches). The problem is that I am working in a bilingual e...

Delphi: CDO.Message encoding problems

Hi! We wrote a Delphi program that send some informations with CDO. In my Win7 machine (hungarian) the accents are working fine. So if I sent a mail with "ÁÉÍÓÖŐÚÜŰ", I got it in this format. I used iso-8859-2 encoding in the body, and this encode the subject, and the email addresses to (the sender address is contains name). I though...

Reading Text with Accent - Python

I did some script in python that connects to GMAIL and print a email text... But, often my emails has words with "accent". And there is my problem... For example a text that I got: "PLANO DE S=C3=9ADE" should be printed as "PLANO DE SAÚDE". How can I turn legible my email text? What can I use to convert theses letters with accent? Tha...

Case Sensitive Accent Folding in Javascript

You wrote this code: accentsTidy = function(s){ var r=s.toLowerCase(); r = r.replace(new RegExp("\\s", 'g'),""); r = r.replace(new RegExp("[àáâãäå]", 'g'),"a"); r = r.replace(new RegExp("æ", 'g'),"ae"); r = r.replace(new RegExp("ç", ...

How can I make a regular expression which takes accented characters into account?

I have a JavaScript regular expression which basically finds two-letter words. The problem seems to be that it interprets accented characters as word boundaries. Indeed, it seems that A word boundary ("\b") is a spot between two characters that has a "\w" on one side of it and a "\W" on the other side of it (in either order), ...

Char with accent to char without accent in C

Hey guys. simple question: how to remove accents from a char? Like ã -> a, and é -> e. I asked in another question how to convert utf-8 to ascii but this is unnecessary, since I only need to treat those situations. I tried: char comando; if( comando == 'ç' || comando == 'Ç') { comando = 'c'; return comando; } But ...

Case and accent insensitive 'like' comparison oracle without altering session on oracle

Hi all I need to be able to do 'like' queries across several languages, so that a umlaut, a and A are treated the same and so on. I'm on 10gr2 of oracle and I can't alter the session. I've been trying things like nls_upper(col_name,'NLS_SORT=BINARY_AI') like nls_upper('%fur%','NLS_SORT=BINARY_AI') but I'm not having any joy. Whatever...