how can I tell nltk to treat the text in a particular language?
BKG: once in a while i write a specialized NLP routine to do POS tagging, tokenizing etc. on a non-english (but still hindo-european) text domain.
this question seem to address only different corpora, not the change in code / settings: http://stackoverflow.com/questions/1639855/nltk-tagging-in-german
alternatively, are there any specialized hebrew / spanish / polish NLP module for python?
thanks