non-ascii-chars

using non-latin characters in a URL

I'm working on a site which the client has had translated into Croatian and Slovenian. In keeping with our existing URL patterns we have generated URL re-writing rules that mimic the layout of the application which has lead to having many non-ascii charachters in the URLs. Examples š ž č Some links are triggered from Flash using getURL...

bash: how do I write non ascii chars using echo?

Is there an escape sequence, such as \012 or something like that? I want to append on ascii chars to a file using echo ?? >> file ...

How to Make Emacs Display Chinese Characters

I often use simple Chinese phrases like "你好" to test that my code can handle non-ascii characters. Whenever I enter Chinese characters directly into Emacs, they just come out as question marks. Emacs can sometimes display characters properly if I open a premade text file but not always. For example, if I create a simple text file in No...

Easy Q: UnicodeEncodeError: 'ascii' codec can't encode character

Hi, I'm trying to pass big strings of random html through regular expressions and my Python 2.6 script is choking on this: UnicodeEncodeError: 'ascii' codec can't encode character I traced it back to a trademark superscript on the end of this word: Protection™ -- and I expect to encounter others like it in the future. Is there a modu...

Easy Python Q: UnicodeEncodeError: 'ascii' codec can't encode character ???

Hi, I'm trying to pass big strings of random html through regular expressions and my Python 2.6 script is choking on this: UnicodeEncodeError: 'ascii' codec can't encode character I traced it back to a trademark superscript on the end of this word: Protection™ -- I do not need to capture the non-ascii stuff, but it is a nuisance and I...

Unicode support in Web standard fonts

I need to decide whether to render geometric symbols in a web GUI (e.g. arrows and triangles for buttons, menus, etc.) as Unicode symbols (MUCH easier and color-independent) or GIF/PNG files (lots of hassle I would like to avoid). However, I have seen Windows clients that have trouble displaying even advanced punctuation symbols declare...

Convert Hi-Ansi chars to Ascii equivalent (é -> e) in Delphi(2007)

Is there a routine available in D2007 to convert the characters in the high range of the ANSI table (>127) to their equivalent ones in pure ASCII (<=127) according to a locale (codepage)? I know some chars cannot translate well but most can, esp. in the 192-255 range: À -> A à -> a Ë -> E ë -> e Ç -> C ç -> c – -> - (that can be trickier...

Non-ascii characters in velocity templates are broken when displayed

Hi! I have non-ascii chars in velocity template files. And when processed they are garbled. The files are saved in UTF-8 encoding and response header contentType is also set to text/html;charset=UTF-8. What else can be done? ...

highlighting non-ascii text

As a non-techie, I would really appreciate your help on this. I have some files (html) that need to be in pure ascii form to be properly processed. Since these files are produced by humans, every so often non-ascii characters sneak in. Often it is a stray " (curled variety) or something similar that is difficult to find and need to b...

mysql query to find every occurrence of a non-ascii character in a table column and replace it with its HTML equivalent

I have a column that contains HTML strings and has characters like ® and ™ in it. I want to replace all such characters with their HTML equivalent. Is this possible? ...

InvalidAttributeValueException when using non-ascii characters in JNDI

Hi, I have some trouble using JNDI since it accepts only 7-bits encoded parameters. I am trying to change an LDAP entry using JNDI with the following code : Attribute newattr = new BasicAttribute("userpassword", password); ModificationItem[] mods = new ModificationItem[1]; mods[0] = new ModificationItem(DirContext.REPLACE_ATTRIBUTE, n...

Handling Special char such as ^ÛY, ^ÛR in java

Hi, Has anybody encountered special char such as ^ÛY, ^ÛR ? Q1. How do I do an ftp of the files containing these chars? The chars are not seen once I do a ftp on AIX (bi or ascii) and hence I am unable to see my program to replace these, working. Q2. My java program doesn't seem to recognise these or replace these if I search for th...

Doctrine Searchable with non-ASCII characters

Hi, I have text in Turkish language: "selam günaydın". Doctrine searchable converts it to keywords in table: -selam -guenaydin So "guenaydin" was saved in table as keyword "günaydın" so when somebody writes in search "günaydın" he gets nothing - what can I do? ...

Help Replacing Non-ASCII character in Python

I have a bunch of HTML files I downloaded using HTTPLIB2 package in Python. ' ' are showing as 'Â '. <font color="#ff0000">02/12/2004Â </font> is showing while <font color="#ff0000">02/12/2004&nbsp;</font> is the desired format. How do I replace the 'Â ' with '&nbsp;' in Python? Thanks a lot! ...

Non-english alpha-numerics in a text file

C# WinForm application EDIT: It appears there's concern about foreign language compatibility. This is a non-issue. The card game I'm making this utility for is primarily in English. In the future I may support other languages, but everything will still be keyed off the English names, which are a primary key in both the program and the...

Java (Groovy) LDAP Non-ASCII Characters

Hi, I've been googling my ass off without any success. So I came here. Hope someone can help me out! Im using Groovy to do some LDAP searches but when a person has å, ä or ö in their names I get for example Jens R√§fteg√•rd. I've tried: new String(attrs.get("name")?.get(0)?.getBytes(), "UTF-8") And even tried: def charsets = Charset....

Regex subbing out 'section character' in java

I'm running a series of regex substitutions (i.e. String.replaceAll calls) to convert all the special characters in a text file to XML parseable special characters. For example: string_out = string_out.replaceAll("&", "&amp;"); I've hit a stumbling block replacing the 'section character' that is, this little squiggle: § For starters,...

how to remove spurious non ascii characters, but keep spaces and newlines?

I have some text files that contain some non ASCII characters, I want to remove them, however keep the formatting characters. I tried $description = preg_replace('/[\x00-\x1F\x80-\xFF]/', '', $description); However that appeared to strip newlines and other formatting out and also had problems with some Hebrew which then converted th...

php csv - print non ascii characters

Hi! I'm trying to create a CSV file using php. How can I print non ascii charaters? Thanks in advance, Best regards! ...

POSTing XML with non-ASCII characters

I'm trying to patch RestSharp for it to be able to POST XMLs with non-ASCII characters as POST request body. Here's how it gets written: private void WriteRequestBody(HttpWebRequest webRequest) { if (HasBody) { webRequest.ContentLength = RequestBody.Length; var requestStream = webRequest.GetRequestStream(); ...