I've got a bunch of data which could be mixed characters, special characters, and 'accent' characters, etc.
I've been using php inconv with translit, but noticed today that a bullet point gets converted to 'bull'. I don't know what other characters like this don't get converted or deleted.
$, *, %, etc do get removed.
Basically what...
Hi folks,
this q has already come up before but the answers given didn't help me.
I have text input boxes which display the " symbol in place of the @ and vice-versa.
when i test locally my @ and " are appearing correctly only when i upload to the server does the problem arise.
I read that you can have "Use device fonts" set for text...
So, I'm using SharePoint's SPUtility.SendEmail() to send an email with non-ascii characters in the subject line. The problem is that the Icelandic character 'ð' is scrambled to '?'. This only happens in the subject line, the message body is fine.
The problem does not seem to be with the email client, since the problem appears both in Gm...
Hello, I'm using a json object to store different bits of content, one of which can contain javascript & html relevant content (like quotes, semicolons, tags etc) which without encoding can break the page. To work around this I'm using:
"content":"<%=StringEscapeUtils.escapeHtml(StringEscapeUtils.escapeJavaScript(content))%>"
(I'm usi...
I have a project where I scrape some data from a HTML file, put it into a MySQL database and then read it out again and display it to the user. Whenever the servlet is inserting something into the DB the "£" sign is being inserted as a � and when read out again presented as that.
The application is written in Java (with Spring) and JPA ...
I've got a database where for efficiency, i've put the data into the db in html encoded formats.
I do maintenance on the data, and then move it into production via an 'into outfile', so it ends up in a text file.
The special characters don't make it across cleanly, and it comes out as all messed up code.
Is there a way to maintain t...
I am using Java NIO for my socket connections, and my protocol is text based, so I need to be able to convert Strings to ByteBuffers before writing them to the SocketChannel, and convert the incoming ByteBuffers back to Strings. Currently, I am using this code:
public static Charset charset = Charset.forName("UTF-8");
public static Cha...
Update: Apparently these are control characters, not Unicode characters.
I'm trying to parse an XML file which has an odd character in it that makes it invalid and is causing my tools (Firefox, Nokogiri) to complain.
Here's what the character looks like in Firefox, and what it looks like when I copy and paste it into Textmate (I'm on O...
I need to replace Microsoft Word version of single and double quotations marks (“ ” ‘ ’) with regular quotes (' and ") due to an encoding issue in my application. I do not need them to be HTML entities and I cannot change my database schema.
I have two options: to use either a regular expression or an associated array.
Is there a bett...
LPCTSTR Machine=L"Network\\Value";
char s[100]="Computer\\";
strcat(s,(const char*)Machine);
printf("%s",s);
Here i received output Computer\N only i expect output like Computer\Network\Value .
Give Solution for that..
...
Hi there,
I'm having strange encoding issue converting NSString to UTF8 cString. I'm fetching XML data from webserver. XML is correctly encoded with UTF-8 encoding. After fetching data, I convert it to NSString as follows:
NSString *XMLdata = [[[NSString alloc] initWithData: receivedData encoding: NSUTF8StringEncoding] autorelease];
...
I am reading an XML document (UTF-8) and ultimately displaying the content on a Web page using ISO-8859-1. As expected, there are a few characters are not displayed correctly, such as “, – and ’ (they display as ?).
Is it possible to convert these characters from UTF-8 to ISO-8859-1?
Here is a snippet of code I have written to attempt...
When I read a text file to a wide character string (std::wstring) using an wifstream, does the stream implementation support different encodings - i.e. can it be used to read e.g. ASCII, UTF-8, and UTF-16 files?
If not, what would I have to do?
(I need to read the entire file, if that makes a difference)
...
In the real world, are there any issues with using the htmlentities in Grade A browsers for → right and left arrows per this link? Can everyone reading this page see these arrows ?
Apparently if the character is not in your font set, they won't appear, but so far my work has been to english speaking audiences...don't the arrows come s...
Or is there such a tag?
...
In the iPhone Push Notification documentation, they have a code snippet in which they override the UIApplication method that receives a device token -
- (void)application:(UIApplication *)app didRegisterForRemoteNotificationsWithDeviceToken:(NSData *)devToken {
const void *devTokenBytes = [devToken bytes];
self.registered = YES;
[self ...
I have a text file and it can be ANSI (with ISO-8859-2 charset), UTF-8, UCS-2 Big or Little Endian.
Is there any way to detect the encoding of the file to read it properly?
Or is it possible to read a file without giving the encoding? (and it reads the file as it is)
(There are several program that can detect and convert encoding/form...
I'm trying to do a bunch translating of html encoded text into utf-8 to put it into my database.
There are a ton of characters that get missed with both html_entity_decode, or iconv with Translit.
I've written up a long list of characters to strip out, but now I see that &Yuml is not translated, but ÿ is.
I'm sure there are othe...
I'm writing a PHP server and the client is sending data in a specific character encoding. Now, I want to tell the server to read and write data in this same character encoding set.
How should I specify the character encoding set for PHP's socket_read and socket_write methods?
...
I've been scouring Google and can't seem to find an answer. I'm running Oracle 10g Enterprise with the following character-set: AR8MSWIN1256
The database holds English and Arabic values, and I need to differentiate between the two from time to time. I wrote this script, which matches on English words but not on Arabic words:
create or ...