ansaurus

Question

Answer 1

+1 A:

This will match a single non-ASCII char:

[^\x00-\x7F]

This is valid PCRE (Perl-Compatible Regular Expression).

EDIT: [^[:print:]] will probably suffice for you.

Alix Axel 2010-01-23 18:16:50

don't you mean [~\x20-\x7f]

adrianm 2010-01-23 19:34:57

@adrianm: No, `^` is valid in PCRE.

Alix Axel 2010-01-23 20:05:39

That's exactly right. However you have to use pcregrep, not standard grep. [^[:print:]] won't work if your terminal is set up in UTF8.

Rory 2010-01-24 12:24:40

Answer 2

+1 A:

You could also to check this page: Unicode Regular Expressions, as it contains some useful Unicode characters classes, like:

\p{Control}: an ASCII 0x00..0x1F or Latin-1 0x80..0x9F control character.

Rubens Farias 2010-01-23 18:58:25

(grep) Regex to match non-ascii characters?