character-encoding

Trouble encoding a u umlaut with in a .Net http handler

I have a JavaScript request going to a ASP.Net (2.0) HTTP handler which passes the request to a java web service. In this system special characters, such as those with an accent do not get passed on correctly. E.G. Human input: Düsseldorf becomes a JavaScript asynch request to http://site/serviceproxy.ashx?q=D%FCsseldorf, which is val...

Displaying International Text

I am looking to create an ASP.net page that will have a control like GridView or Repeater and the data to be displayed in this page can be either unicode or Utf-8 . I am really struggling to display languages like Hebrew and some asian languages. How do I show any type of language on the ASP.net page?? I have tried the meta tag option ...

Best Resource for Character Encodings

I'm searching for a document (not printed) that explains in details but still simply the subject of character encoding. ...

Setting the default Java character encoding?

How do I properly set the default character encoding used by the JVM (1.5.x) programmatically? I have read that -Dfile.encoding=whatever used to be the way to go for older JVMs... I don't have that luxury for reasons I wont get into. I have tried: System.setProperty("file.encoding", "UTF8"); And the property gets set, but it doesn't...

Character encoding problem - PHP output, read by .NET, via HttpWebRequest

I have a PHP script (running on a Linux server) that ouputs the names of some files on the server. It outputs these file names in a simple text-only format. This output is read from a VB.NET program by using HttpWebRequest, HttpWebResponse, and a StreamReader. The problem is that some of the file names being output contain... unusual c...

StreamReader problem - Unknown file encoding (western iso 88591)

When reading data from the Input file I noticed that the ¥ symbom was not being read by the StreamReader. Mozilla Firefox showed the input file type as Western (ISO-8859-1). After playing around with the encoding parameters I found it worked successfully for the following values: System.Text.Encoding.GetEncoding(1252) // (western iso 8...

Streaming files and character sets problem

I have made some code which exports some details of a journal article to a reference manager called Endnote The format of which is a list of items like below (an author): %A Schortgen Frédérique Unfortunately, I am having some encoding problems somewhere, as when endnote opens the file, this is what it makes of the above author: ...

How to get &nbsp to behave properly using HTML Purifier?

I am using HTML Purifier in my PHP project and am having trouble getting it to work properly with user input. I am having users enter in HTML using a WYSIWYG editor (TinyMCE), but whenever a user enters in the HTML entity   (non-breaking space) it gets saved into the database as this weird foreign character (Â). However, the thing...

Unicode appnames in Django

Hi, I live in Norway, and when i make Django apps i would like to be able to name my apps with characters like "æøå", these characters work fine in unicode, but when i try to use these characters in app names, or in fields display text i get an error. Even better, i would like to name my apps by the english convention, but have somethi...

How to remove (?) while we convert our byte content from unicode to ansi character

I need to convert the unicode characters to ansi characters byte[] encode = Encoding.Convert(Encoding.Unicode, Encoding.Default, report); I use this piece of code. While I am viewing this I found that extra ? character is added in the first part ?FF EE 20 12 ...

How can I find Non-Ascii characters in MySQL

I'm working with a MySQL database that has some data imported from Excel. The data contains non-ascii characters (em dashes, etc.) as well as hidden carriage returns or line feeds. Is there a way using MySQL to find these records? ...

What is meant by htmlencode and urlencode ?

What is meant by HTMLEncode in ASP.net. How we use this HTMLEncode to controls. What is meant by urlEncode and in what way it differenitates from HTMLEncode? ...

Getting the € with htmlentities

I am really trying to show what htmlentities gives me, but it doesn't give & euro; for the € character. I am trying echo htmlentities(htmlentities("LISBOA-VIENA DESDE 99€ TAXAS INCLUÍDAS, RESERVE JÁ",ENT_COMPAT,ISO-8859-1),ENT_COMPAT,ISO-8859-1); echo '<br>'; echo htmlentities(htmlentities("LISBOA-VIENA DESDE 99€ TAXAS INCLUÍDA...

Why does Python's string.printable contains unprintable characters?

I have two String.printable mysteries in the one question. First, in Python 2.6: >>> string.printable '0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ!"#$%&\'()*+,-./:;<=>?@[\\]^_`{|}~ \t\n\r\x0b\x0c' Look at the end of the string, and you'll find '\x0b\x0c' sticking out like a sore-thumb. Why are they there? I am usin...

Detecting spanish accents in chars and converting them to non accented chars

I have a large CSV with a list of words from a spanish dictionary. I wish to take each one and put them in a collection belonging to the letter they are in. i.e abertura goes in A However, spanish vowels can have accents in them like the world "ácido" In this case I still want it to go in my "A" collection. I realise of course, I ...

Converting letters to their greek equivalent in Javascript

I have some JSON data from a web service which gives me data like the following blah blah <greek>a</greek> I need to be able to convert what is inside the greek tags into their symbol equivalent, using javascript. Any ideas? ...

How can I tell whether data in table has incorrect encoding?

I have a couple tables that are set to the latin1 character set but I suspect have been erroneously been inserted with some values that are actually encoded using utf8. MySQL makes this a little more complicated because it silently converts everything based on your connection settings. How can I test my hypothesis that there are some u...

Display ñ on a C# .NET application

I have a localization issue. One of my industrious coworkers has replaced all the strings throughout our application with constants that are contained in a dictionary. That dictionary gets various strings placed in it once the user selects a language (English by default, but target languages are German, Spanish, French, Portuguese, Man...

Read Csv file encoding error

I am using the following method for reading Csv file content: /// <summary> /// Reads data from a CSV file to a datatable /// </summary> /// <param name="filePath">Path to the CSV file</param> /// <returns>Datatable filled with data read from the CSV file</returns> public DataTable ReadCsv(string filePath) { ...

Possible Encoding Issue Reading HTM File using .Net Streamreader

I have an HTML file with a ® (copyright) and ™ (trademark) symbol in the text. These are just two among many other symbols. When I read the html file into a literal control it converts the symbols to something else. The copyright symbol converts to � (open box in ff) The trademark symbol converts to ™ (as expected) If (System.IO.File.E...