ansaurus

Question

find the position of a string in another string

Answer 1

+5 A:

You really must to scan A from the start.

There are good algorithms of fast substring search, e.g. http://en.wikipedia.org/wiki/Knuth%E2%80%93Morris%E2%80%93Pratt_algorithm

There is also a standard function strstr:

strstr(A,B)

http://www.cplusplus.com/reference/clibrary/cstring/strstr/

osgx 2010-03-03 08:42:41

Or `std::string::find` in the case of C++.

Hans W 2010-03-03 08:51:44

Answer 2

A:

The optimal way to solve this is by using the KMP algorithm: http://en.wikipedia.org/wiki/Knuth%E2%80%93Morris%E2%80%93Pratt_algorithm

IVlad 2010-03-03 08:42:46

Is KMP trick really fastest for all cases? It is a bit outdated (1977). I think, there must be some faster tricks.

osgx 2010-03-03 08:54:04

KMP gives the best worst-case performance as far as I know.

IVlad 2010-03-03 08:55:44

Answer 3

+1 A:

"Trick" is another word for algorithm, I guess. The most famous one is Knuth-Morris-Pratt.

unwind 2010-03-03 08:43:01

Answer 4

+2 A:

See http://en.wikipedia.org/wiki/Boyer%E2%80%93Moore_string_search_algorithm

Patrick 2010-03-03 08:43:54

Answer 5

A:

In C programming you have function strstr to find the sub string position in the source String.

pavun_cool 2010-03-03 08:52:07

Answer 6

A:

If you have a big string and you're going to search for many substrings,
it's good to have in mind the suffix array structure.
Basically, you create an array of pointers and you sort it.
Then you can locate any substring with a binary search.

Nick D 2010-03-03 08:54:08

Answer 7

A:

The problem that you are solving is the "exact string matching" problem. The naive solution runs in O(n^2) time, but you can do much better than that. Some linear-time algorithms to solve this problem are Knuth-Morris-Pratt (KMP), Boyer-Moore, and Apostolico-Giancarlo. Another way to solve it is by constructing a finite state automaton that enters an accepting state when it sees the pattern string. The best possible solution is O(n), and all those have that worst-case running time. You do have to scan the string from one end to the other; however, it is possible to skip a fraction of the characters (which Boyer-Moore and Apostolico-Giancarlo will do), since some mismatches can imply other mismatches.

If you need to code this yourself, I recommend you go with the Knuth-Morris-Pratt algorithm, since it is a little bit more intuitive and easier to implement than the other solutions I've mentioned. Most programming languges, though, have an "indexOf" or "find" function that will solve this for you.

Michael Aaron Safyan 2010-03-03 08:57:18

ansaurus

tags:

views:

answers:

find the position of a string in another string

related questions