ansaurus

Question

How many non-printing characters are in common use?

Answer 1

+1 A:

In development world there's at least one more (most often used in web development)

&nbsp;  // non-breaking space

But the more you get to design world the more you see various space/invisible characters. Publishing software normally has

space - the regular SPACE
en space
em space
thin space
hair space
non-breaking space
non-breaking fixed width space
sixth space
quarter space
third space
punctuation space
flush space
figure space
...

Robert Koritnik 2009-10-26 21:41:49

Yes, 0xA0 ; see http://en.wikipedia.org/wiki/Non-breaking_space

peter.murray.rust 2009-10-26 21:46:12

@Robert could you please list the numbers?

peter.murray.rust 2009-10-26 21:49:06

#fail. I've just written the ones I see in my InDesign. I'm not sure if all of them are actual UNICODE standard ones. Sorry. Some are rather design oriented (like flush space) and maybe exist only in software.

Robert Koritnik 2009-10-26 22:43:50

Answer 2

+1 A:

Unicode will be with us, in increasing quantity, for a long time. If an HTML or XML document is written in UTF-8 encoded Unicode, then you should expect any and all of these to appear.

In Unicode (Unicode Character Database) the following codepoints are defined as whitespace:

U+0009–U+000D (control characters, containing Tab, CR and LF)
U+0020 SPACE
U+0085 NEL (control character next line)
U+00A0 NBSP (NO-BREAK SPACE)
U+1680 OGHAM SPACE MARK
U+180E MONGOLIAN VOWEL SEPARATOR
U+2000–U+200A (different sorts of spaces)
U+2028 LS (LINE SEPARATOR)
U+2029 PS (PARAGRAPH SEPARATOR)
U+202F NNBSP (NARROW NO-BREAK SPACE)
U+205F MMSP (MEDIUM MATHEMATICAL SPACE)
U+3000 IDEOGRAPHIC SPACE

Michael Dillon 2009-10-26 21:42:51

@Michael thanks - useful. Doesn't overlap with the ones I listed.

peter.murray.rust 2009-10-26 21:48:02

ansaurus

tags:

views:

answers:

How many non-printing characters are in common use?

related questions