Implementing proximity operators in full-text search | ansaurus

tags:

views:

22

answers:

0

Q:

Implementing proximity operators in full-text search

Some databases (e.g. Scopus and Web of Science) implement proximity operators such as SAME or NEAR. With these the user can define that he wants his search words to be within a set number of words from each other or in the same sentence or paragraph.

I just started wondering how this is implemented. Full-text search as is is not that complex but how is the indexing done to make proximity searches feasible. The databases I have tried have hundreds of thousands of articles so obviously they have been indexed, but if you try to use the same tactic as with normal full-text search (as I understand they have been implemented) the size of the index would be impractically large.

So, how is this done?

related questions

Best way to search data stored as XML in Sql Server?

What are the alternative's to using the iThenticate service for content comparison?

Search by hash?

Free text search integrated with code coverage

How-to: Ranking Search Results

Find item in WPF ComboBox

Find in Files: Search all code in Team Foundation Server

Searching for phone numbers in mysql

How do I implement Search Functionality in a website?

Can you perform an AND search of keywords using FREETEXT() on SQL Server 2005?

How do I search content, within audio files/streams?

Search Plugin for Safari

Search strategies in ORMs

Using Lucene to search for email addresses

SQL Server Full Text Searching

How do you do a case insensitive search using a pattern modifier using less ?

WildcardQuery error in Solr

PowerShell FINDSTR eqivalent?

Parsing search queries in Java

Need Pattern for dynamic search of multiple sql tables

grep a file, but show several surrounding lines?

Eclipse : Class file name must end with .class exception in Java Search

MOSS SSP problem - Failed database logons from deleted SSP

Incomplete results with Turkish characters in Indexing Service

Lucene Score results