character-encoding

SVN and accented characters

I have a subversion repository that I administer. When I try to add a file which has accented (non-ASCII) characters in its name, the server refuses the command. The server is running on Ubuntu, the client is Eclipse on Windows. I could not find anything in the SVN FAQ about accented characters in filenames, nor in the SVN Book. I know ...

odd .bat file behavior

I have a bat file with the following contents: set logfile= D:\log.txt java com.stuff.MyClass %1 %2 %3 >> %logfile% when I run the bat file though, I get the following: C:\>set logfile= D:\log.txt C:\>java com.stuff.MyClass <val of %1> <val of %2> <val of %3> 1>>D:\log.txt The parameter is incorrect. I'm almost positive the "...

Problem when reading a txt file in Adobe AIR

I've been working on an app running in a browser and I would like to try to convert it to an AIR one. In my web app, after loading the document, I am loading a text file in a string with jquery. var info; $.get('media/info.txt',function(data) { info=data; alert("the info is "+info.length+" bytes long"); }); In my AIR app, after...

Problem transmitting null character over sockets

I am writing a small Java server, and a matching client in C++, which implement a simple IM service over the STOMP protocol. The protocol specifies that every frame (message that passes between server and client, if you will) must end with a null character, which in code I refer to as '\0', both in Java and in C++. However, when I tran...

How to keep special characters when running ./mysqldump?

How do you keep the special characters when you run mysqldump? On the database it appears correctly when i do a query. But on the file after i export, all special characters look like this: 'R. António Lt1, 1º Dtº' ...

Change from HTML character refrences to utf-8 in a bash script ie. &#257; becomes ā

How would you go about translating a document that contains the following character references to their actual readable characters in a bash script? &#257; &#225; &#462; &#224; &#275; &#233; &#283; &#232; &#299; &#237; &#464; &#236; &#470; &#472; &#474; &#476; &#252; &#470; &#472; &#474; &#476; &#252; These change in order to ā á ǎ à ...

IE munging pound symbol

I have a html form which goes of to do all sorts of strange back end things. This works fine in firefox. and in most cases it works fine in IE However the (pound sterling) £ sign causes problems, and seems to get munged in the submit. The forms is something like this <form action="*MyFormAction*" accept-charset="UTF-8" method="post"...

What is the prefered encoding to store resource text files that need to be translated?

And why? what character encoding trouble have you had and how did you solve it? ...

Codeigniter Character Encoding Issues

Dear StackOverflow Community, I am running Codeigniter. I have a few simple pages which have no DB data, are just static loaded as following from the controller. I have set everything possible to UTF-8 - I have checked headers which result to UTF8 - however it still parses wrong (these characters: ���). function index(){ $this->l...

Displaying Hebrew text in ASP

I am working on ASP application that reads data from sql server and displays it in a table. All my Hebrew text is replaced with "?????". I installed and Configured Hebrew Font in Regional options. I have set the appropriate Charset in Head,Meta tag I am able to see proper Hebrew text in sqlServer2005ManagementSudioExpress. I have prope...

Convert cs file encoding

I recently got asked to review a Visual Studio solution. One of the first things I did was run SourceMonitor on it. But when I tried to analyze it, I got an error that said SourceMonitor could not parse Unicode or UTF-encoded files. Is there a quick was to convert all of the .cs file in a solution to ASCII encoding? ...

setting utf8 with mysql through php

I have the following very simple code, which retrieves utf8 formatetd data, such as containing umlauts from a mysql database, which may or may not be set as utf8. If I use either of the commented out approaches to ensure that utf8 data is returned, the data will NOT be returned as utf8, however if I leave them off, the data will be displ...

How to deal with special characters in ASP.NET's HyperLink.NavigateUrl?

I am currently having troubles figuring out how to handle a filepath to be (dynamicly) passed out to a HyperLink control's NavigateUrl property. Let's say that I'm trying to refer to a file named jäynä.txt at the root of C:. Passing "file:///C:/jäynä.txt" result to a link to file:///C:/jäynä.txt, as does HttpUtility.UrlPathEncode("fil...

Converting ú to u in javascript

How would I convert ú into u in javascript. I might possibly need it for other non-english characters too. ...

System.IO.StreamWrite and Spanish Characters

I need to write the following string to a txt a File: SEGS,AUS1,1,0,0,712205,584,8659094,2,NUÑEZ FELIX ARTURO,584 I when I use: using (System.IO.StreamWriter sw = new System.IO.StreamWriter(@fileSobrantes, true)) { sw.WriteLine("SEGS,AUS1,1,0,0,712205,584,8659094,2,NUÑEZ FELIX ARTURO,584"); } I get this in the file SEG...

.NET: How do I tell if an encoding supports all the chars in my string?

I've got lots of text that I need to output, which includes all sorts of characters from many languages. Sometimes I need to output the text in character encodings other than Unicode (eg, Shift-JIS, or ISO-8859-2), in order to match the page it's going to. If the text has characters that the encoding can't handle (eg, Japanese character...

How to convert UTF-8 character to ISO Latin 1?

I need to convert a UTF-8 trademark sign to a ISO Latin 1, and save it into database, which is also ISO Latin 1 encoded. How can I do that in java? I've tried something like String s2 = new String(s1.getBytes("ISO-8859-1"), "utf-8"); but it seems not work as I expected. ...

How can I convert non-ASCII characters encoded in UTF8 to ASCII-equivalent in Perl?

I have a Perl script that is being called by third parties to send me names of people who have registered my software. One of these parties encodes the names in UTF-8, so I have adapted my script accordingly to decode UTF-8 to ASCII with Encode::decode_utf8(...). This usually works fine, but every 6 months or so one of the names contai...

Broken accent characters when Copy / Paste into ASP .Net

I am copy pasting from an MS word document into an ASCX file. When I view the ascx file, the accented characters appear normally. BUT, when the page is rendered through my ASP.net application, the accented characters are broken: Une promenade dans un verger ensoleillé, un peau de pêche délicatement parfumée… Les plaisirs du pr...

utf-8 vs Unicode

I have heard conflicting opinions from people - according to wikipedia, see here there are the same thing. Are they? can someone clarify? ...