For the two string searching algorithms: KMP and suffix tree, which is preferred in which cases? Give some practical examples.
+3
A:
A suffix tree is better if you will have to answer a lot of queries such as "is the needle present in the haystack?". KMP is better if you only have to search for one string in another single string, and not have to do it a lot of times.
A suffix tree is a much more general data structure, so you can do a lot more with it. See what you can do with it here. KMP is useful for finding if a string is a substring in another string.
You might also want to check out other algorithms, such as Boyer-Moore, Rabin-Karp and even the naive algorithm, as there are situations (inputs) in which one is better than the others.
Bottom line is:
- If you have a lot of queries like the one I mentioned above, it's worth to build a suffix tree and then answer each query faster.
- If you need to do more than those kinds of queries, a suffix tree is also worth building.
- If you only care about occasionally finding if a string is a substring of another string, then use KMP.
IVlad
2010-04-10 11:40:48