Hi!
I have 2 texts (max 4000 characters) of different length. And I need to get a similarity rate based on (partial-)paraphrasing. Please note that same portion of texts can be in different position in each text (So Levenshtein is not the solution).
The comparison process should also:
- not increase expo. with text size
- be performance friendly. :)
It seems that the "adaptive local alignment of keywords" is a possible solution.
Do you have any implementation example? Preferred language is PHP but I can translate. :)
Do you have any other solution/idea/experience on that topic?
Thanks for your great help.