views:

29

answers:

1

Does IRC support internationalized (UTF-8) room names?

How? A pointer to documentation or a spec would be welcome.

+1  A: 

According to RFCs 1459 and 2812, channel names can be made up of any bytes other than \x00 (NUL), \x07 (BELL), \x0A (LF), \x0D (CR), \x20 (space), , and :. How those bytes are interpreted is entirely up to the client. IRC doesn't impose any specific restrictions on the encoding.

No specific character set is specified. The protocol is based on a set of codes which are composed of eight (8) bits, making up an octet. Each message may be composed of any number of these octets; however, some octet values are used for control codes, which act as message delimiters.

jamessan
This is true, but conventions apply on many servers (for example, on Japanese servers the convention is often to interpret channel names and topics in JIS).
caf