views:

318

answers:

2

For use to analyze documents on the Internet!

+2  A: 
John Paulett
It does use NLTK throughout and is a great book if you want to do NLP with python, just like the name says! :)
ealdent
+3  A: 

Alternatively, R has many tools available for text mining, and it's easy to integrate with Python using RPy2.

Have a look at the Natural Language Processing view on CRAN. In particular, look at the tm package. Here are some relevant links:

In addition, R provides many tools for parsing HTML or XML. Have a look at this question for an example using the RCurl and XML packages.

Shane