indexing

What are the practical usages of functions metaphone() and soundex() in PHP?

Metaphone and Soundex are phonetic algorithms for indexing strings by their English pronunciation. Have you ever used functions metaphone() or soundex() that are present in the standard PHP library? What for? What are real-life usages of these functions? ...

Any good way to handling repeats when using Lucene indexing?

Hi all, I am using Lucene to index my documents. In my case, each document is rather in small size but having a large quantity (~2GB). And in each document, there are many repeating words or terms. I am wondering if it is the right way for me to do index using Lucene or what preprocessing I should do on the document before indexing. Th...

Sql server indexes tips

Hi, I have been reading for the last two days about indexes and how to optimize a database. Despite I have a clear idea on Indexes i don't know yet how to practically optimize my database. Could you suggest any tutorial or technique to optimize a database. Please be specific as i have been reading lots of theory but no specific instr...

Creating indexes for optimizing the execution of Stored Prcocedures

Hi All, The WHERE clause of one of my queries looks like this: and tbl0.Type = 'Alert' AND (tbl0.AccessRights like '%'+TblCUG0.userGroup+'%' or tbl0.AccessRights like 'All' ) AND (tbl0.ExpiryDate > CONVERT(varchar(8), GETDATE(), 1) or tbl0.ExpiryDate is null) order by tbl0.Priority,tbl0.PublishedDate desc, tbl0.Title asc ...

Django - Is it ok to erase django generated database indexes?

AFAIK django creates indexes for you in the database (I use MySQL). As these indexes are preventing me from altering column names as part of a south migration I'm performing, I would like to remove these indexes. Is it legitimate to manually erase django indexes? Will django recreate them? ...

How to optimize delete with date range where clause?

I created an android app and there is a function that will periodically delete old records delete from tablea where col1 = 'value1' and col2 = 'value2' and postdate < '2010-06-14' It has performance problem when total number of row in tablea has more then 50,000. It take around 45 seconds to delete 500 records. I already have index f...

md5 hash for urls in unique Index

I was asked this before with slight different with current question. but did not got the answer I was looking into. My question is do I need to store md5($url) in unique index in MySQL?? I have seen this in some code actually I don't remember..this is a large database with more than 5 million urls and the indexing is done by calling url...

Constraints to follow for Indexing in MySQL

Hello I would be indexing my database tables. Present constraints I am following is indexing the columns which would be most used in "where" clauses in my queries. Would that be the right constraint or are there any other constraints or checks to be followed?? And how would indexing affect the database on the whole? ...

How can I quickly search my code using unix?

I frequently search for a token, perhaps a function name, throughout my codebase. My traditional method would be to grep for the term itself. However, the codebase is so large that I can't do this efficiently (it takes minutes). Is there a way to do this efficiently? ack (which ignores irrelevent files such as revision control files) i...

Can google see the contents of an iframe when spidering?

I've looked this up and have not found consistent answers. I want to embed a google doc in my page (when you publish your google doc it gives you an iframe). Will search engines like google be able to read the contents of the document (just text, but may have important keywords)? Or will it act as if the page was empty? If it cannot ind...

Numpy interconversion between multidimensional and linear indexing

I'm looking for a fast way to interconvert between linear and multidimensional indexing in Numpy. To make my usage concrete, I have a large collection of N particles, each assigned 5 float values (dimensions) giving an Nx5 array. I then bin each dimension using numpy.digitize with an appropriate choice of bin boundaries to assign each...

Solr indexing problem

Hello, I am new to Solr. When i index the files, every variable gets indexed, but some are not searchable, how can i stop solr from displaying any results in that case. ...

mysql indexes to set for a query

What indexes i have to set for a query like this? thanks SELECT distinct event_dates.* FROM `event_dates` INNER JOIN `events` ON `events`.id = `event_dates`.event_id INNER JOIN `cores` ON `cores`.resource_id = `events`.id AND cores.resource_type = 'Event' INNER JOIN `cores_kinds` ON `cores_kinds`.core_id = `cores`.id INNER JOIN `kin...

Pass index to temporary table from regular table?

I am creating a temp table with a query like this: CREATE TEMPORARY TABLE temp_table SELECT * FROM regular_table WHERE 1 But regular_table has FULLTEXT index on some of the fields. I try to do a FULLTEXT search on the new temporary table and I get an error telling me "Can't find FULLTEXT index matching the column list". So obviusly th...

Future proofing FULLTEXT search..

I have a FULLTEXT index called cart_product_fti on a table Whenever I use it, I have to list all the fields that are in the index exactly: MATCH (foo,bar,etc) My thoughts are, if later I decide to add/remove a field to the index, all the scripts are going to have to be updated or they will be broken. So I am wondering if there is a...

Linked Server and Cached Indexes

Have a DTS package that is running in Development, SIT, and UAT. All of the SQL Servers in each of the environments has the same linked servers setup and are the tech specs similar. The DTS package takes different amounts of time to execute. The package should take about 3 hours to run. There are indexes on the tables that are access...

Basic optimisation with an index for mysql

Hi, i have a question about basic mysql database optimisation. I have 3 tables, Articles, Tags and Taggings (which is a join table). Articles Taggings Tags id id id name article_id name tag_id I am retrieving the articles that exactly match the ...

Unable to add an index on a field with InnoDB?

hey all, I have an innodb table that I'm trying to run an index on and I'm getting the following error: Deadlock found when trying to get lock; try restarting transaction The syntax is: ALTER TABLE mytable ADD INDEX (fieldtoindex); any ideas as to why that would throw a deadlock error? thanks! ...

SQL Timeout and indices

Changing and finding stuff in a database containing a few dozen tables with around half a million rows in the big ones I'm running into timeouts quite often. Some of these timeouts I don't understand. For example I got this table: CREATE TABLE dbo.[VPI_APO] ( [Key] bigint IDENTITY(1,1) NOT NULL CONSTRAINT [PK_VPI_APO] PRIMARY KEY, ...

mysql index size and duplicacy questions

Hello, I have a few general questions about database Index. Answers to any question are awaited: How to calculate total memory size occupied by an index. And is the size of index directly proportional to its Cardinality ? Do we need to index primary key, or is it indexed by its own. And how will it effect the memory and processing spe...