Does anyone know of a great small open source Unicode handling library for C or C++? I've looked at ICU, but it seems way too big.
I need the library to support:
- all the normal encodings
- normalization
- finding character types - finding if a character should be allowed in identifiers and comments
- validation - recognizing nonsense