views:

95

answers:

0

I need to extract the vector space representation of several documents and then to compute the cosine distance among them.

I'd like to use that distance to classify some new documents using a k-Nearest-Neighbor approach.

Do you have some suggestion on the libraries I could use?

So far I saw that both Weka and Apache Lucene should support Vector Space Model, which one do you think is the one that best fits my needs?