views:

67

answers:

1

Not the base ones (like wc)... I need tools to build tag clouds, to compute semantic distance between files, to extract word-dictionary from a file, etc.

+2  A: 

Its a broad question... Have you looked into NLTK. This is an open source library, written in Python, which provides very many Natural Language processing primitives. This would not work 'out of the box' ("some assembly required ;-) ) but would allow any customization that you may require.

mjv