indexing

Eclipse indexer implementation

I'd like to implement eclipse plugin, which parses .properties files and remembers keys from these files for quick-searching them. It should work similar to Open Type or Open Resource, but with property keys. I don't know what is best practice for implementing such plugin. Should it work as an builder? Should it register itself for work...

what are the fastest/Popular search technologies

What are the fastest search technologies apart from Relational DB Searches ? I have a collection of Text Files from varied sources (Banks/Ledgers/Stock Markets). Each line in these Text Files is a Record. Each line can further be parsed into some DB Columns(Stock Name/Date of purchase/Owner/...). It is not necessary that each line has a...

sqlite Indexing Performance Advice

I have an sqlite database in my iPhone app that I access via the Core Data framework. I'm using NSPredicates to query the database. I am building a search function that needs to search six different varchar fields that contain text. At the moment, it's very slow and I need to improve performance, probably in the sqlite database. Would ...

index with multiple columns - ok when doing query on only one column?

If I have an table create table sv ( id integer, data text ) and an index: create index myindex_idx on sv (id,text) would this still be usefull if I did a query select * from sv where id = 10 My reason for asking is that i'm looking through a set of tables with out any indexes, and seeing different combinations of select queries...

Has anyone built an nhibernate provider for Microsoft Index Server?

This may be an odd question! But I was wondering if anyone has ever attempted this. I guess the first question most people would ask is why!! With index server you can add additional meta data to standard file information (name, size etc). I want to be able to expose this information through to a BL component via nhibernate I need to ...

get similarity score between two document termfreqvectors

Hi I would like to extract similarity score between two document termfreqvectors. I checked that if i submit the first one as a query and look the second in the result set, I cannot have the precise score that lucene gives for these two vectors? any help? ...

Index over a column with only 5 distinct values - Worth it?

I have a table with a potential of up to 5.000.000 rows. One of the columns in this table is used alone in queries, but there is only 5 possible values of this column, and currently I got 10.000 rows and according to the explain plan it makes no sense to use my index on that column. Will it ever, or shouldn't I bother with an index Edi...

Index usage with OUTER JOIN that contains IN statement

SELECT a.*, b.* FROM a LEFT OUTER JOIN b ON b.user IN (:userlist) AND b.key = a.fk_to_b WHERE a.user IN (:userlist) OR b.user IN (:userlist) Table b has an index of: (user, key) The database only uses the index when the :userlist parameter contains a single value. When :users contains multiple values (which ...

How do website pages gets indexed by the search engines?

How do website pages gets indexed by the search engines? Does this depends on number of times the page is visited? I have a site whose default page got indexed but not the other pages which are at the same level as that of default! ...

Efficient query to lookup stuff in a word index

I have two tables defined like this: Page(id), Index(page_id, word) page_id in Index is a foreign key to Page so that each Page is connected to a group of Index entries. The Index table is a index for the Page table so that you can do fast text searching. E.g: SELECT page_id FROM Index where word = 'hello' Would select all page_id'...

Thinking Sphinx - Foreign key with different type - Association problem

Hello, I have two tables on mysql: users, and management. The users table has a numeric id, and the management table has a varchar foreign key which is the primary key of the other table. The types are not the same, and this seems to be the main problem when I build an index from the User model, and try to include one column from the ma...

How do I assign an empty matrix to elements of a cell array in MATLAB?

I want to manipulate a cell array and make certain indices of the cell array contain the empty matrix []. I can't seem to figure out how to do this: >> yy=num2cell(1:10) yy = [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] >> yy{1:2:end}=[] ??? The right hand side of this assignment has too few values to satisf...

Document Stored in File System Text Searching and Filtering required in ASP .Net Application

Hello Experts, We are building a jobsite application in which we will store resumes of all the candidates, which is planned to store on file system. Now We need to search inside that file and provide the result to the user, we need to provide that what is the best solution to implement text searching. I have just tried to identify it ...

Oracle's Bitmap Index in SQL Server

I've been successfully using bitmap indexes for years in Oracle environment. Now I'm working on a data warehouse project which will be hosted on SQL Server 2005. So far, I was unable to find an answer for bitmap like indexes in SQL Server. Any Ideas? ...

MySQL query performance on two indexes + sorting

Hi, I'm implementing the following access policy: a User can access a Resource if he created it, belongs to the Resource's group members or if the resource is publicly available. Here is my DB structure (tables are MyISAM): User (1K-10K Users) id nickame … index user_name(id, nickname) Group (1K) id … Resource (10K-100K) ...

Does creating a nonclustered index on a SQL Server 2005 table prevent selects?

I'd like to create an index on a view I have but I need to make sure the data can still be read while the index is being created. I was reading an article that suggested that when creating a nonclustered index that the data is still readable if you specify the ONLINE=ON option (example below): CREATE UNIQUE CLUSTERED INDEX CLUST_IDX_SQ...

SQL Indexing - Computed Column vs Field Used by Computed Column

Quick question for the DBA's out there: Say I have 2 columns on my table, IsDeleted (bit) and DeletedDate (datetime). The table contains approx 10,000,000 rows. IsDeleted is a computed column that checks to see if DeletedDate is NULL; and it returns 1 if it is not, and 0 if it is. Querying this table will mainly be done on the IsDele...

How to calculate Column Selectivity

I want to create an index on a table and I'm trying to decide the order of the columns in the index. Normally, I go by gut feeling and my knowledge of the data and of the queries the application will typically generate, but in this case looking at the data and the queries I think I need to create and index based on column selectivity. So...

How can I index HTML documents?

I am using Lucene .NEt to do full-text searching. Till now I have been indexing PDF docs, but now I have a few webpages that I need to index. What's the best/easiest way to index HTML documents to add to my Lucene index? I am using .NET/C# ...

How do table indexes come into play when using a view?

I have a view that collects data from several tables. While there are no indexes on the view itself anything that uses the view seems to benefit from the underlying tables having indexes. Are these being used automatically? If they are then what is the point of creating indexes on your views? Any recommended articles on this subject ...