Hi all,
I am running some string matching tests using the Smith-Waterman algorithm. I am currently using SimMetrics (the Java open source project) to run the tests.
Can anyone explain why when I compare 'Bloggs J.' to 'Bloggs' I get a similarity value of 1.0?
There obviously is a gap (e.g. 'o' and '.'), but it does not appear to be penalized.
Thank you in advance.