ansaurus

Question

Answer 1

+1 A:

The full text search only matches words and word prefixes. Because your data in the database does not contain word boundaries (spaces) the words are not indexed, so they are not found.

Some possible choices you could make are:

Fix your data so that it contains spaces between words.
Use LIKE '%text%' instead of a full text search.
Use an external full-text search engine.

I will expand on each of these in turn.

Fix your data so that it contains spaces between words.

Your data seems to have been corrupted somehow. It looks like words or sentences but with all the spaces removed. Do you know how that happened? Was it intentional? Perhaps there is a bug elsewhere in the system. Try to fix that. Find out where the data came from and see if it can be reimported correctly.

If the original source doesn't contain spaces, perhaps you could use some natural language toolkit to guess where the spaces should be and insert them. There most likely already exist libraries that can do this, although I don't happen to know any. A Google search might find something.

Use LIKE '%text%' instead of a full text search.

A workaround is to use LIKE '%text%' instead but note that this will be much slower as it will not be able to use the index. However it will give the correct result.

Use an external full-text search engine.

You could also look at Lucene or Sphinx. For example I know that Sphinx supports finding text using *text*. Here is an extract from the documentation which explains how to enable infix searching, which is what you need.

9.2.16. min_infix_len

Minimum infix prefix length to index. Optional, default is 0 (do not index infixes). Infix indexing allows to implement wildcard searching by 'start*', '*end', and 'middle' wildcards (refer to enable_star option for details on wildcard syntax). When mininum infix length is set to a positive number, indexer will index all the possible keyword infixes (ie. substrings) in addition to the keywords themselves. Too short infixes (below the minimum allowed length) will not be indexed.

For instance, indexing a keyword "test" with min_infix_len=2 will result in indexing "te", "es", "st", "tes", "est" infixes along with the word itself. Searches against such index for "es" will match documents that contain "test" word, even if they do not contain "es" on itself. However, indexing infixes will make the index grow significantly (because of many more indexed keywords), and will degrade both indexing and searching times.

Mark Byers 2010-05-15 08:26:53

so what is the best approach for this???? Please suggest....

Avinash 2010-05-15 08:29:44

i have edited my question, please check, new query added which also not working...

Avinash 2010-05-15 08:33:28

how can i store it with spaces, can u please explain, i am open to do it, if its not taking more time.....

Avinash 2010-05-15 08:34:38

but using like will not perform full text searching......

Avinash 2010-05-15 10:45:09

@Avinash - I have deleted all my comments and integrated them into my answer. I hope that helps you.

Mark Byers 2010-05-15 17:52:16

ansaurus

tags:

views:

answers:

Mysql query problem....

related questions