search

Can I use arbitrary metrics to search KD-Trees?

I just finished implementing a kd-tree for doing fast nearest neighbor searches. I'm interested in playing around with different distance metrics other than the Euclidean distance. My understanding of the kd-tree is that the speedy kd-tree search is not guaranteed to give exact searches if the metric is non-Euclidean, which means that I ...

Search in SVN repository for a file name

Issue:- Search in SVN epository for with file name. The Problem is:- We have a bulk repository for code contain thousands of folder and sub folder, i want to search under this repositor with file name or with some word. Exp:- Root folder a\ b\ c\ d\ e\ f\ab\ f\ab\cd.txt I want to search for cd.txt but don...

Search multiple indices at once using Lucene Search

I am using Zend_Search_Lucene to implement site search. I created separate indices for different data types (e.g. one for users, one for posts etc). The results are similarly divided by data type however there is an 'all' option which should show a combination of the different result types. Is it possible to search across the different i...

How to find a similar code fragment?

Does anyone has some tool or some recommended practice how to find a piece of code which is similar to some other code? Often I write a function or a code fragment and I remember I have already written something like that before, and I would like to reuse previous implementation, however using plain text search does not reveal anything,...

How can you use parameterized statements with DB2 Text Search?

I've tried this: select * from ourschema.mytable where contains(mysearchablefield, @searchTerms) = 1; Where @searchTerms was set to "search terms" Unfortunately, it only produced an error: ERROR [42610] [IBM][DB2/NT] SQL0418N A statement contains a use of a parameter marker that is not valid. SQLSTATE=42610 Is there a way to us...

searching social networking

How can I organize a search in a social network web application? Searching is done by full name. I want to use stored procedures. Is it the best sollution? What algorithm can be used? While registering, the user specifies his/her full name for ex: Alice Johnson Martin. I want to search for a user using his/her fullname. In case someone ...

Search for string and get count in VI editor

Well I want to search for a string and find number of occurrences in a file opened using Vi editor. ...

What's the best way to search GitHub?

The search feature on the site seems pretty awful. Are there any external sites that do a better job of categorizing projects with tags, etc? Or maybe I'm just not using GitHub correctly? ...

Client Side Pagination

What are the best libraries available right now for client side pagination? ...

Efficient in-line search and replace for large file

Hi. There are some standard tools to do this, but I need a simple GUI to assist some users (on windows). They will get an open file dialog and pick the file to process. The file will be an XML file. The file will contain (within the first few lines) a text string that needs to be deleted or replaced with whitespace (doesn't matter which...

How do I query for data indexed as IndexEmbedded(FullText in nhibernate.search)

How do I query for data added as IndexEmbedded? I have an entity class [Indexed] public class Something { [Field(Index.Tokenized, Store = Store.Yes)] public virtual string Description { get; set; } [IndexedEmbedded] public virtual Category Category { get; set; } [IndexedEmbedded] public virtual Location L...

Parsing search queries for SQL 2008 FTS.

We want to use SQL SERVER 2008 Full Text Search and seem to run into a lot of problems handling the search query. If the user types in "blue dog" it just crashes sql unless we parse the search terms to include the "" around the words but that makes it a phrase instead of keywords. I want results where blue or dog are included but that ...

Text indexer search tool which can filter by punctuation?

This is not a programming question per se but a question about searching source code files, which help me in programming. I use a search tool, X1, which quickly tells me which source code files contain some keywords I am looking for. However it doesn't work well for keywords which have punctuation attached to them. For example, if I sea...

Efficiently searching a 3-level hierarchy

Lets say I have a FirstName > MiddleName > LastName hierarchy (~10k rows, for sake of the question). This means you could have "John > Mary-Anne > Eddy" or "Eddy > John > Jacob" row. The point being that the hierarchy makes little sense and is very foreign to the user (unlike, say, a Country > State > City structure). Because its so u...

Best algorithm to index sentences.

Hello there, Imagine I have a situation where I need to index sentences. Let me explain it a little bit deeper. For example I have these sententes: The beautiful sky. Beautiful sky dream. Beautiful dream. As far as I can imagine the index should look something like this: But also I would like to do search by any of these words...

database design

hi, i'm developing web search tool for college search... tool should support to search college... 1. by course 2. by fees 3. by name is there i have to follow any database algorithm to make my search so efficient... is it must i have to use datamining technique when college data grows so ...

Delphi 2009: Search skipping diacritics in unicode utf-8

I am having utf-8 encoded file containing arabic text and I have to search it. My problem are diacritics, how to search skipping them? Like if you load that text in Internet Explorer (converting text in HTML ofcourse ), IE is skipping those diacritics? Any help? Edit1: Search is simply performed by following code: var m1 : TMemo; /...

Return surrounding text for phrase found in full-text search, SQL 2005

I'm using a contains predicate to find phrases in a SQL Server indexed text field. Is there a way to return the portion of the text field that contains the searched phrase, or some area around it? For example, if I'm searching for "all men are created equal" in the Gettysburg address (excerpted below), I'd like to return "dedicated to ...

How Prevent Google Duplicate Content Problem | Multi Site

I'm about to launch a multi-domain affiliate sites which have one thing in common which is content. Reading about the problem with duplicate content and Google I'm a little worried that the parent domain or sub sites could get banned from the search engine for duplicated content. If I have 100 sites with similar look and feel and basica...

Where can I find a boatload of source files for stress testing source scanning systems?

I thought the Linux kernel would have a massive amount of SLOC but it isn't even making OpenGrok blink. I know the question of the largest project has been asked but what collection of code is arguable the largest. i would prefer either tar, gz, or a hub style place where I can script and point my subversion or git client. ...