latent-semantic-indexing

Latent Semantic Indexing in Java

Is there any open source implementation of LSI in Java? I want to use that library for my project. I have seen jLSI but it implements some other model of LSI. I want a standard model. ...

Latent Semantic Indexing

It is said that through LSI, the matrices that are produced U, A and V, they bring together documents which have synonyms. For e.g. if we search for "car", we also get documents which have "automobile". But LSI is nothing but manipulations of matrices. It only takes into account the frequency, not semantics. So whats the thing behind thi...

Need help in latent semantic indexing

Hi, I am sorry, if my question sounds stupid :) Can you please recommend me any pseudo code or good algo for LSI implementation in java? I am not math expert. I tried to read some articles on wikipedia and other websites about LSI ( latent semantic indexing ) they were full of math. I know LSI is full of math. But if i see some source co...

Problem for lsi

I am using Latent semantic analysis for text similarity. I have 2 questions. How to select K value for dimention reduction? I read alot every where that LSI work for similary meaning words for example car and automobile. How is it possible??? What is the magic step I am missing here? ...

categorize websites - open source LSI?

Im looking to categorize lots of websites (millions). I can use Nutch to crawl them and get the content of the sites, but I am looking for the best (and cheapest or free) tool to categorize them. One option is to create regular expressions that look for certain keywords and categorize the sites, but there area also high end LSI type too...

Latent Semantic Indexing

I'm trying to find out how to carry out the multiplication of the matrices produced after SVD implementation in LSI. I need this for my research. I want to carry out document clustering. ...