full-text-search

Text search tool for large source code set with up to date pre-indexing?

I'm doing maintenance of a few branches of middle-size C++ project (~15k files for each branch). Very often I have to search all project files for given string or regex. Currently I'm using Total Commander which has all features I want (case-sensitive, regexes, filename masks) but this tool scans all files every time, so it takes a bit t...

How to do fuzzy string search without a heavy database?

I have a mapping of catalog numbers to product names: 35 cozy comforter 35 warm blanket 67 pillow and need a search that would find misspelled, mixed names like "warm cmfrter". We have code using edit-distance (difflib), but it probably won't scale to the 18000 names. I achieved something similar with Lucene, but as PyLucene only...

How do I disable a full-Text Search on a column in sql server?

I'm trying to convert a column from ntext to nvarchar(MAX), but it seems there is a Full-Text search enabled on it. Alter table <table> alter column <column> nvarchar Then i'm going to force the text into rows update <table> set <column> = <column> +'' where <column> is not null Finally I'll need to enable the full text search...

How to do a full text search in Cocoa?

I need something like Lucene to do an optimized full text search in Cocoa. I am working on an Iphone app to search through a database. Anybody has any luck with other databases. Any help is appreciated. So far, I can only find this. http://github.com/tcurdt/lucenekit/tree/master ...

How to add search functionality to my application

I am writing Windows application (with Borland C++ Builder), which stores large number of text files. I want users to be able to search these files very fast, so I need an indexing and search library. I do not use database, but my own file format for storing the documents (all are in a single file). Are there such libraries for Windows?...

MS SQL Server: 'contains' vs 'charindex'

I have a problem with a query in ms sql server. I have a full text index on a column called "col1". The data in this column can get quite large (20, 30 kb +). I now want to search in this column for an exact phrase. I have been told that the "contains" function is the fastest function for this, but I am aware of at least 2 other ways of...

SQL to search objects, including stored procedures, in Oracle.

I need to write some sql that will allow me to query all objects in our Oracle database. Unfortunately the tools we are allowed to use don't have this built in. Basically, I need to search all tables, procedures, triggers, views, everything. I know how to search for object names. But I need to search for the contents of the object. i....

Add files to fulltext index without storing them as blob

Is there a way to add files to a sql server index without adding them as blobs to the database? Currently we do extract the text content of all our pdf files usind pdf2text and store them as a blob. This way, sql server can add the texts to his index. But this is a big waste of space. We are using sql server 2005. Can the new filestream ...

how to search xcode doc set

here's another really stupid question. XCode has a Developer Documentation window. There are sections dedicated to sorting the results of a language reference search. There's no search bar. No button that says "Search for:". No magnifying glass. What am I looking for??? Google doesn't understand my question, and I thought I'd ask you ...

How to insert a text file into a field in PostgreSQL?

How to insert a text file into a field in PostgreSQL? I'd like to insert a row with fields from a local or remote text file. I'd expect a function like gettext() or geturl() in order to do the following: % INSERT INTO collection(id, path, content) VALUES(1, '/etc/motd', gettext('/etc/motd')); -S. ...

Corporate site search

Having a small, SEO-friendly corporate site behind a basic password protection scheme, I'd like to add a basic search facility by using one of the off-the-shelf solutions. (I don't mind Google, or Yahoo having the site's content, as long as it doesn't leak to the web). Google doesn't seem to support this, so I'd like to ask the SO commu...

What is the best approach for a interpreting an text input for geocoding purposes?

Consider the following site: http://maps.google.com It has a main text input, where the user can type business, countries, provinces, cities, addresses and zip codes. I wonder which is the best way to implement a search like this. I realize that probably Google Maps uses a full text search with all kinds of data in the same table, and ...

how to search

I have a reocred file and I need to make a programm that should help fined any word into that recored by typing a word into a search field, the recored file is contain a list of words so I need ta serach into that list to fined any word that could be similar to my word I am useing Delphi 2007 ...

Stackoverflow Related questions algorithm

The related questions that appear after entering the title, and those that are in the right side bar when viewing a question seem to suggest very apt questions. Stackoverflow only does a SQL search for it and uses no special algorithms, said Spolsky in a talk. What algorithms exist to give good answers in such a case. How do U do datab...

Postgresql: how to make full text search ignore certain tokens?

is there a magic function or operator to ignore some tokens? select to_tsvector('the quick. brown fox') @@ 'brown' -- returns true select to_tsvector('the quick,brown fox') @@ 'brown' -- returns true select to_tsvector('the quick.brown fox') @@ 'brown' -- returns false, should return true select to_tsvector('the quick/brown fox') @@ ...

Searching sub-models with Ferret

I have a rails app in which I am trying to do some full text searching on. Ferret seems to be the most popular choice. However, I have an issue. I have a 'thing' which contains an id which determines if a user can see it (and therefore search it), but I want to search sub-models of this 'thing' which are related to my 'thing' but don'...

Which field had my search text in Lucene when using a MultiFieldQueryParser?

I'm using Lucene.Net's MultiFieldQueryParser to search multiple fields in my documents. I want to find out which field the text was found. For example, my search might look like this: var parser = new MultiFieldQueryParser(new string[] {"question","answer"}, analyzer); var query = parser.Parse(searchphrase); for(int idx=0; idx<hits.Len...

SQL Server 2005 and Full Text Search Performance problem

Dear All, I have a heavy table on sql server 2005 with enable full text search on it, I want to get top N newest data by id or insert date with specific full text. some query result with "Containstable" return over 500,000 record and take over 1 min, that is very bad, also top n by rank not good for me because maybe return old data ...

Algorithm faster than BMH (Boyer–Moore–Horspool) Search

Which algorithm would you use to search short substrings in short texts? By short I mean 5-10 characters for substring and 255 for the string. I'm thinking about choosing algorithm depending on input data length. Which algorithm is better for longer inputs? ...

How to stop regex matching after 1 match without using non-greedy character

Is there anyway to get a regex pattern to automatically stop searching after one match. I'd like to add regex searching to a web service I'm trying to create, but I don't want someone to be able to run a regex that would take a long time, just one match is good enough. Is this possible? ...