encoding

Why is .NET unable to decode this string when Java can "Hi%E1" ?

My problem is with .Net Http/Uri libraries not being able to decode or unescape this character sequence: "Hi%E1". Neither Uri.UnescapeDataString nor HttpUtility.UrlDecode can do it. Although I have a solution to get around this problem ( http://stackoverflow.com/questions/1221849/url-decoding-confusion ) I would like to understand why ...

'ASCII' to Unicode error in python when attempting to read a latin-1 encoded string

Hi All- I'm having a problem when trying to apply a regular expression to some strings encoded in latin-1 (ISO-8859-1). What I'm trying to do is send some data via HTTP POST from a page encoded in ISO-8859-1 to my python application and do some parsing on the data using regular expressions in my python script. The web page uses jQuer...

Inconsistent get_class_methods vs method_exists when using UTF8 characters in PHP code

I have this class in a UTF-8 encoded file called EnUTF8.Class.php: class EnUTF8 { public function ñññ() { return 'ñññ()'; } } and in another UTF-8 encoded file: require_once('EnUTF8.Class.php'); require_once('OneBuggy.Class.php'); $utf8 = new EnUTF8(); //$buggy = new OneBuggy(); echo (method_exists($utf8, 'ñññ'))...

how to encode/decode escape sequence characters in python

how to encode/decode escape sequence character '\x13' in python into a character that is valid in a RSS or XML. use case is, I am getting data from arbitrary sources and making a RSS feed for that data. The data source sometimes have escape sequence character which is breaking my RSS feed. So how can I sanitize the input data with e...

php remove/identify this symbol �

EDIT: Ok I have some data (A ton of data) being pulled from a MySQL DB Table, nothing special about how the data is entered. When parsing the data and re-displaying it to Firefox this symbol � shows up. When I compare it to the DB entry it looks like a space (Nothing special). I'm using all the default PHP/MySQL settings. Doing a var_d...

Algorithms: data binarization

I have a huge dataset with words word_i and weights weight[i,j], where weight is the "connection strength" between words. I'd like to binarize this data, but I want to know if there is any existing algorithm to make binary code of each word in such a way that the Hamming distance between the codes of the words correlates with this weig...

Ruby IRB output is messed up in the console on Windows 7

I am getting very strange outputs from my IRB console. Here it is: irb(main):001:0> File.dirname(__FILE__) => ←[0;31m"←[0;0m←[0;36m.←[0;0m←[0;31m"←[0;0m Seems like an encoding issue, right? I am not sure why this is happening. Any ideas how to fix it? ...

Make custom string encoder .net

I know .net supports base64 encoding of byte arrays. But i thought that i could save even more space if use a higher number of characters. I read somewhere that Unicode supports thousands of different characters so why not use base1024 encoding for example? And if this is possible can you give some guidelines on how to implement it. Tha...

What is ViewState? How is it encoded? Is it encrypted? Who uses ViewState?

What is ViewState? How is it encoded? Is it encrypted? Who uses ViewState? ...

Change File Encoding to utf-8 via vim in a script

Hi, i just got knocked down after our server has been updated from Debian 4 to 5. We switched to UTF-8 environment and now we have problems getting the text printed correctly on the browser, because all files are in non-utf8 encodings like iso-8859-1, ascii, etc. I tried many different scripts. The first one i tried is "iconv". That o...

Changing character sets on a live MySQL database

I currently have a bunch of tables using the latin1 charset in a MySQL 5.1.x DB. Problem is, we recently had a bunch of users trying to input text using UTF-8 encoding, and that seemed to break things. Is it safe to blindly update the table's character set? What are some best practices (besides obviously backing everything up) for a sit...

ActiveRecord pulls records out in ASCII encoding in Ruby 1.9

Hello, I'm trying to migrate my app to Ruby 1.9, however ActiveRecord keeps retrieving records out of my MySQL database with an ASCII encoding, causing "incompatibility between utf-8 and ASCII" like errors. I've tried setting the "encoding: utf-8" in the database.yml file, and I've also tried putting " #coding: utf-8 " at the top the err...

Twitter posting tweet in url

Hello everyone, I'm trying to use a 'tweet this page' function in a flash piece using the url format 'http://www.twitter.com/home?status=go+to+this+page+http://www.example.com/#/page10'. The flash project has deep-linked urls using SWFAddress. This requires the urls to have the pound character ( http://www.example.com/#/page10 ). I ...

How to store custom objects in NSUserDefaults

Alright, so I've been doing some poking around, and I realize my problem, but I don't know how to fix it. I have made a custom class to hold some data. I make objects for this class, and I need to them to last between sessions. Before I was putting all my information in NSUserDefaults, but this isn't working. -[NSUserDefaults setObje...

Escaping non-ASCII characters (or how to remove the BOM?)

I need to create an ANSI text file from an Access recordset that outputs to JSON and YAML. I can write the file, but the output is coming out with the original characters, and I need to escape them. For example, an umlaut-O (ö) should be "\u00f6". I thought encoding the file as UTF-8 would work, but it doesn't. However, having looked a...

HttpServletRequest - SetParameter

I know that I can use HttpServletRequest.getParameter() to get the URL parameter values. Is there an equivalent method with which I can set/replace the value? ...

IE7 iso-8859-1 encoding

I'm noticing that when posting a form through IE7, accented characters are getting messed up as soon as the request hits the server. But in FF3 it works just fine. The page encoding is set to ISO-8859-1. Is there any particular place I should start investigating? Thanks in advance! EDIT: The behavior I'm noticing is specifically a ü ch...

Is there any non-GPL-opensource C\C++ H264 encoding library?

Could someone please point me to an opensource H264 encoding library (written with C/C++) that is not licensed under GPL? It is for not commercial app creation of cource. I just do not want to open its sources. ...

XML column in SSIS has byte-order-mark

I'm using an oledb data source in an SSIS package to pull a column from a database. The column is XML data type. In SSIS, it is automatically recognized as data type DT_NTEXT. It's going to a script component where I'm trying to load it into a System.Xml.XmlDocument. This is the code that I'm using to get the xml data into a string: Sys...

Defining the character encoding of a JavaScript source file

I would like to print a status message to my German users, which contains umlauts (ä/ü/ö). I also would like them be in the source file rather than having to download and parse some extra file just for the messages. However, I can't seem to find a way to define the encoding of a JS source file. Is there something like HTML's http-equiv?...