character-encoding

Encoded character is used instead the correct one

I have a little problem and I'm hopping that you can help me solve this annoying issue. I need to use an iFrame in an administration panel to let users use the selection service, and in the HTML I have: <iframe scrolling="yes" runat="server" title="Par Selection" id="iFrame" frameborder="0" enableviewstate="true" width="100%" heigh...

What does it mean for a char to be signed?

Given that signed and unsigned ints use the same registers, etc., and just interpret bit patterns differently, and C chars are basically just 8-bit ints, what's the difference between signed and unsigned chars in C? I understand that the signedness of char is implementation defined, and I simply can't understand how it could ever make a...

Which character set to choose when compiling a c++ dll

Could someone give some info regarding the different character sets within visual studio's project properties sheets. The options are: None Unicode Multi byte I would like to make an informed decision as to which to choose. Thanks. ...

Does converting a column from varchar to nvarchar change the encoding of the strings stored in the column?

If I have a database containing only varchar columns, with strings encoded in Latin-1 (and collation set accordingly), will converting the columns to nvarchar using ALTER TABLE commands change the encoding of the column contents to unicode (UTF-16), or do I have to convert them myself? ...

What is the difference between a 'character' and an 'octet'?

I see the term 'octet' popping up in literature about nonces for hashing, and it seems to be synonymous with 'character', although there is a kind of pattern to how the words are used. This leads me to believe that there is a formal distinction between the two. If anyone could enlighten me to what it is, I'd appreciate it. (and please,...

Windows Codepage Interactions with Standard C/C++ filenames?

A customer is complaining that our code used to write files with Japanese characters in the filename but no longer works in all cases. We have always just used good old char * strings to represent filenames, so it came as a bit of a shock to me that it ever worked, and we haven't done anything I am aware of that should have made it stop...

How to set the character encoding in a yaml file

We are working with the simple backend for the new Rails 2.2 i18n system, and I wanted to know the proper syntax for setting the encoding in a yaml file. On other words what is the yaml for this xml: <?xml encoding="UTF-8" ?> ...

How to use resources in VC++?

I am using VC 9 and I want to support Russian language for my application. I even created Russian resource strings. But my system has Russian Language setting. If it is not there every character displays junk (its code page is 1251). I also made DLL from Russian resource file. If I run that DLL in application from installed location, it ...

Detect file encoding in PHP

I have a script which combines a number of files into one, and it breaks when one of the files has UTF8 encoding. I figure that I should be using the utf8_decode() function when reading the files, but I don't know how to tell which need decoding. My code is basically: $output = ''; foreach ($files as $filename) { $output .= file_ge...

What charset does Microsoft Excel use when saving files?

I have a Java app which reads CSV files which have been created in Excel (e.g. 2007). Does anyone know what charset MS Excel uses to save these files in? I would have guessed either: windows-1255 (Cp1255) ISO-8859-1 UTF8 but I am unable to decode extended chars (e.g. french accentuated letters) using either of these charset types. ...

Handing Non-UTF8 content in my Rails application appropriately

I have a Rails application that allows users to import information from various sources using RSS feeds and such. My default encoding on the database is UTF8 and I've been receiving a lot of exceptions in regards to non-UTF8 data that is coming through the system and crashing once it hits the database. I'm to appropriately detect the n...

Unix vs. Windows rendering of characters

I have a text file that display differently when opening it in FreeBSD vs. Windows. On FreeBSD: An·lisis e InvestigaciÛn On Windows: Análisis e Investigación The windows representation is obviously right. Any ideas on how to get that result in bsd? ...

Strange characters returned from byte stream?

When trying to display a byte stream from HLDS (Half-Life Dedicated Server) in a textbox, it displays strange blocky question mark characters that look something like this: [?] Here's a sample line from the byte stream (with [?] in place of the strange character): CPU In Out Uptime Users FPS Players[?] 0.00 0.97 0.91 ...

Should I still use html entities? Why?

Is html entities still useful or should I simply create UTF-8 encoded html documents? Please explain why. ...

C programming: How to program for Unicode?

What prerequisites are needed to do strict Unicode programming? Does this imply that my code should not use char types anywhere and that functions need to be used that can deal with wint_t and wchar_t? And what is the role played by multibyte character sequences in this scenario? ...

Charset for foreign languages

I'm currently doing some HTML that with urdu, farsi and chinese simplified characters. I'm having problems finding good resources online on what charset to use: <meta http-equiv="Content-Type" content="text/html; charset=???" /> Any suggestions? ...

server-side includes and character encoding

Hi, I created this static website in which each page has the following structure: Common stuff like header, menu, etc. Page specific stuff in main content div Footer In the website linked above all the common stuff was duplicated in each page. In order to improve the maintainability I refactored the pages to use server-side includes...

change file encoding

Hi, I have a problem with character encoding in some HTML pages. It seems that the cause of the problem is that some of the .html files are not saved as UTF-8 encoded files. Even though I have instructed Eclipse to save these files as UTF-8, when I open them in a browser, it indicates that the files are ISO-8859-1. How can I change the...

How to make nntplib author name human readable?

python NNTPLib is giving me author name such as , "=?Utf-8?B?RGVubmlzIEJhc2hhbQ==?= < [email protected] >" (Quotes for clarity). How do i encode this text in human readable format? ...

Displaying a Downward Triangle in VB.NET ▼ (U+25BC)

Hey, I'm trying to figure out how to display the ▼ character properly in a .NET winform application. I am creating a custom control, and for the button, I want this character to appear. I am able to set the text to this character, but it appears as a blank square. Any ideas on what I need to do to make this character appear properl...