Hi All, Thank you all great guys here for helping people like me :) I just need small hint ....
I calculated tf/idf values of two documents. Following is the tf/idf values 1.txt 0.0 0.5 2.txt 0.0 0.5
The documents are like 1.txt = > dog cat 2.txt = > cat elephant
As now I have tf/idf values. Can any body tell me how to use these values to calculate cosine similarity??
I already read wikipedia and all other tutorial that i should calculate dot product then find distance then divide dot product by distance. I am not good in math. That's why I couldn't understand what they are doing with X,Y :)
If u can just tell me how to calculate using my values. I will understand and implement it.
One more question. In is important both documents should have same number of words?
Thanks !