full-text-search

How do you do phrase-based full text search in postgres that takes advantage of the full-text index?

Let's say you have a postgres 8.3 table as follows: CREATE TABLE t1 (body text, body_vector tsvector); I want to be able to search it for phrases using the full text index (GiST, GiN or both on the tsvector column). The best workaround I've been able to find is to first do the full text search on both words (boolean AND) and then do a...

Why should (or shouldn't) a Search Query return back only document IDs?

So for a new project, I'm building a system for an ecommerce site. The idea is to import products from suppliers and instead of inserting them directly into our catalog, we would store all the information in a staging area. Each supplier has their own stage (i.e. table in the database), and then I will flatten the multiple staging areas ...

full-text search using Google App Engine and JDO?

I'm using Google App Engine (Java) with JDO. How can I do the JDO equivalent of select * from table where field like '%foo%' The only recommendation I have seen so far is to use Lucene. I'm kind of surprised that something this basic is not possible on out-of-the-box GAE. ...

Can I configure stop words programmatically with PostgreSQL full-text search?

I'm using PostgreSQL full text search for a project where traditional stop words ('a', 'the', 'if' etc.) should be indexed and searchable, which is not the default behaviour. For example, I might want my users to find results for the query 'to be or not to be'. The documentation indicates that I could achieve this by creating an empty ...

Auto-completion for names against a SQL database

I have a text field in my web app where I want to do auto-completion (e.g. the user types "St" and I can suggest "Steve"). The names I'm matching against are in a SQL database table of users. My question is, how can I make this happen in a way that will scale to massive amounts of users? There's DB full text search or something like ...

The best online resources for full-text searching in Microsoft SQL?

So I've learned the difference between FREETEXT, FREETEXTTABLE, CONTAINS, and CONTAINSTABLE. And I've created a pretty cool search engine that combines a full-text enabled search with a tagging system (with a little help from you guys). But where have you gone to really learn about and master full-text searching and get the most out of...

lucene, or sql fulltext?

I want to create a search website to search docs (all kinds of formats including pdf), images, videos, and audio. I also want to be able to filter my search results based on some criteria like author name, date, etc. I'm doing this in .NET, so what's the easiest way to get up and running? SQL fulltext searching seems tempting becaus...

Questions about Sql Server 2008 FullText Search

I'm using Sql Server 2008 FullText Search for a project. I need to be able to search PDf files, and I had some questions relating to that: How do I enable PDF searching? I've heard of the adobe filter, but couldn't find a clear guide on how to get started. Are the PDF files stored in the DB itself, or in the file system? I was mainly...

Bypassing word break in SQL 2005 Full Text

I have SQL Server 2005 with a Full Text index on a product table. I am running into issues with the word breaker parsing on the - (dash) character. If I have the following values in my [Keyword Search] field I see the following results; ABC-Z-XY more text... ABC-A-XY more text... ABC-ABC-XY more text... ABC-ABC-CE more text... -- brin...

full-text searching for non-English text in sqlite

How can I make some ful-text searching in non-english texts with sqlite? I use php5.3, with pdo_sqlite. CREATE VIRTUAL TABLE example USING FTS3(title TEXT, TOKENIZE icu hu_HU) This one is dont throw an Exception, but just a 0 byte length .sqlite file has been created. I dont see any virtual table. Below this link, i read that pdo_sqli...

sql server - full-text search

So let's say I have two databases, one for production purposes and another one for development purposes. When we copied the development database, the full-text catalog did not get copied properly, so we decided to create the catalog ourselves. We matched all the tables and indexes and created the database and the search feature seems t...

Lucene - Searching several terms in different fields

I have a Lucene index which populates from a database. I store/index some fields and then add a FullText field in which I index the contents of all the other fields, so I can do a general search. Now let's say I have a document with the following two fields: fld1 - "Samsung releases a new 22'' LCD screen" fld2 - "Sony Ericsson phone's b...

how to extract text from djvu and other ebooks format (possibly in Python)

I have a collection of ebooks in djvu, pdf, chm format and I am looking for a way to search the keyword in the content. I have been researching around and find couple suggestion to parse pdf content but there seems to be no way to convert the content in djvu into text. By any chance, does anyone know a way to decode djvu content into tex...

Why are PostgreSQL Text-Search GiST indexes so much slower than GIN indexes?

I'm testing out the PostgreSQL Text-Search features, using the September data dump from StackOverflow as sample data. :-) The naive approach of using LIKE predicates or POSIX regular expression matching to search 1.2 million rows takes about 90-105 seconds (on my Macbook) to do a full table-scan searching for a keyword. SELECT * FROM P...

fulltext search on xml in SQL Sever 2008

It is a simple query, how to set-up fulltext search on xml column in SQL Server 2008. ...

Searching SQL Server

I've been asked to put together a search for one of our databases. The criteria is the user types into a search box, SQL then needs to split up all the words in the search and search for each of them across multiple fields (Probably 2 or 3), it then needs to weight the results for example the result where all the words appear will be th...

SQl Server 2008 FILESTREAM, Indexing Services, and FTS

I need to implement a service to search PDFs. Initially I started using SQL Server 2008 FTS, but soon realized that my PDFs would have to be stored in the DB itself. I was then pointed to Indexing Services as well as to the SQL 2008 FILESTREAM data type so that I can store PDFs in the file system. So how do these three (Indexing Servi...

MySQL full text fails on a particular table

I've been tearing my hair as to why this fails I have the following code $query = " SELECT DISTINCT title, caption, message, url, MATCH(title, caption, message, url) AGAINST ('$searchstring' ) AS score FROM news WHERE (valid = 1) AND MATCH(title, caption, message, url) AGAINST ('$searchstring' ) UNION ALL SELECT DISTINCT title, captio...

FullText Search using multiple tables in SQL

Hi there. I have 3 tables, tblBook(BookID, ISBN, Title, Summary) tblAuthor(AuthorID, FullName) tblBookAuthor(BookAuthorID, BookID, AuthorID) tblBookAuthor allows for a single book to have multiple authors and an author may have written any number of books. I am using full text search to search for ranking base on a word: SET @Word ...

Matching of user search keywords to tags

Can somebody advise idea, how to match user input (few words) to appropriate tags in system (each of them also 1 or N words)? Here is sample to demonstrate problem: I have a Tags, assigned to objects. For example (tags are separated by COMA, but in real life I have relation to table) Object Tags Earth ...