full-text-indexing

How to do weighted MySQL match search

Hi guys, so we can use mysql to search for full text indexed fields via the MATCH keyword... my question is...is there a native way to do this in which certain column's index is given greater weight than the other So for instance if I search using the indexes title, keywords, and description...is there a way to make entries in title a...

Speed up Oracle Text indexing or let the indexer work only on low load times

We're using a Oracle Text CTXSYS.CONTEXT index to index about half a million rows containing metainformation. The information is spread over two tables that are combined by a procedure that the indexer calls at runtime (functional index). When I run the CREATE INDEX on my local machine (simple dualcore notebook) the index is built in ab...

Creating a Full Text Index search.

I've created a blog and I wish to search through certain tables in my MySQL databases and then return results for the user on a separate search page. I do not wish to use Google CSE. How would I go about creating this for my site. I found a post on StackOverflow.com from a friend of mine in which he wished to make his more efficient. How...

Is it possible to get a list of relevant words from a full-text index given a specific row?

I wuld like to do som automatic tagging of incoming text in our system and I was wondering if a full-text index is capable of providing a a ranked list of words given an indexed row. If not, do you have any suggestions on how to do this? We already have a system in place for auto tagging but its fairly brute-force (aka. slow) in its met...

Full-Text Indexing/Searching for Paging

Greetings Overflowers, I'm currently not using full-text indexing/searching capabilities of any relational database engine. This is because they do not satisfy my application complex query requirements: searching by Arabic root/template/stem/diacritics... etc ("/" stands for and/or) using logical operators between them (in 1) such as ...

FULLTEXT key too long

I get the following error when trying to add a FULLTEXT index to Title, Edition and Author in my MySQL database: Specified key was too long; max key length is 1000 bytes Here are the columns: `Title` varchar(255) NOT NULL, `Edition` varchar(20) default NULL, `Authors` varchar(255) default NULL, None of them are unique.. Even th...

apache cassandra query/full text search

Hi everyone, I've been playing around with apache's cassandra project. Done a fair bit of readin and i have some fairly complex examples that i've done, including inserting single and batch sets of data, retrieving a single and multiple data sets based on keys. Some of the articles i've looked at include http://www.rackspacecloud.com/...

TIFF IFilter not properly reading text in VARBINARY column.

link textI want to use the TIFF IFilter built in to Windows 2008 Server R2 with Full-Text search in SQL Server 2008... also R2. I have installed the filter through server manager and updated the "Force TIFF IFilter to perform OCR for every page in a TIFF document" Local Group Policy setting in Computer Configuration -> Administrative Te...

Network drive indexing frequency

The company I work for have millions of documents that are stored and shared on multiple network drives mapped to users' drives (e.g.] d:\ to \server1\, etc). What I'd like to implement is to crawl over network drives and let users find files fast using a full-text indexing. My current indexing strategy is Lucene.net But I am not su...

Sample Database for Full Text Searching

I am looking to do some benchmarking on Full Text Search indexes in PostgreSQL, SQLServer and Lucene. Any ideas on where to find a good big sample database to perform queries against? Thanks a lot in advance. ...

Slow MySQL full text search

Hi guys, I'm using this query to perform a full text search on a MySQL database: SELECT DISTINCT questions.id, questions.uniquecode, questions.spam, questions.questiondate, questions.userid, questions.description, users.login AS username, questions.questiontext, questions.totalvotes, MATCH(questions.questiontext, questions.uniquecode...

How can I set up Solr to tokenize on whitespace and punctuation?

I have been trying to get my Solr schema (using Solr 1.3.0) to create terms that are tokenized by whitespace and punctuation. Here are some examples on what I would like to see happen: terms given -> terms tokenized foo-bar -> foo,bar one2three4 -> one2three4 multiple words/and some-punctuation -> multiple,words,and,some,punctuation ...

Two Applications using the same index file with Hibernate Search

Hi, I want to know if it is possible to use the same index file for an entity in two applications. Let me be more specific: We have an online Application with a frondend for the users and an application for the backend tasks (= administrator interface). Both are running on the same JBOSS AS. Both Applications are using the same databas...

Cannot get right results from postgre full-text search

Hi fellas, I'm developing a simple articles website in brazilian portuguese language. The search feature is based on a full-text search, but it isn't returning expected results. I made this on postgresql. Here is the simplified table: Artigos -id -title -- article title -intro -- article introduction -content -- article body -publishd...

How much extra storage does MySQL fulltext index consume?

I am looking for a way to estimate how much extra storage will be needed if a MySQL TEXT column gets a fulltext index. Intuitively speaking the size of the fulltext index is dependent on the total length of text but what exactly is affecting it? ...

How to index a web site.

I'm asking on behalf of somebody, so I don't have too many details. What options are available for indexing site content in an ASP.NET web site? I suspect SQL Server's Full Text index may be used if the page content is stored in the database. How would I index dynamic and static content if that content isn't stored in the DB, but in ...