ansaurus

Question

Java: how to check if character belongs to a specific unicode block?

Answer 1

+4 A:

Yes, you can simple use Character.UnicodeBlock.of(char)

Dennis Cheung 2009-01-01 08:49:02

Answer 2

A:

You have the opposite problem to this one, but ironically what doesn't work for him it just should work great for you. It is to just look for words in English (only ASCII compatible chars) with reg-exp "\w".

Fernando Miguélez 2009-01-03 13:08:46

Answer 3

+1 A:

If [A-Za-z]+ meets your requirement, you aren't going to find anything faster or prettier. However, if you want to match all letters in the Latin1 block (including accented letters and ligatures), you can use this:

Pattern p = Pattern.compile("[\\pL&&\\p{L1}]+");

That's the intersection of the set of all Unicode letters and the set of all Latin1 characters.

Alan Moore 2009-01-04 11:31:37

ansaurus

tags:

views:

answers:

Java: how to check if character belongs to a specific unicode block?

related questions