search-engine

asp.net search application index update help

hi im developing a simple search application( ASP.Net VB.Net) the index table is actually a hash table ll be stored in my file system. the search page ll open this in read mode n copy this to a hash table object n perform search. other update n delete functions will open this in write mode n update it. what should i ve to do to make thi...

Save a binary file in SQL Server as BLOB and text (or get the text from Full-Text index)

Currently we are saving files (PDF, DOC) into the database as BLOB fields. I would like to be able to retrieve the raw text of the file to be able to manipulate it for hit-highlighting and other functions. Does anyone know of a simple way to either parse out the files and save the raw text on save, either via SQL or .net code. I have f...

looking for nutch alternative

i am looking for a open source full featured web search engine like nutch , because nutch is complex and it take much time to penetrate its code and i didnt find a book about it . ...

AJAX - querying a search engine and returning the number of results

Right, so basically I need to query a selection of search engines in an AJAX app. As there is a number of different search engines - there's no search engine specific API I can use either. My main problem is getting the number of results returned by the search. So far, I have decided it is probably best to use a regexp from the returned...

Will dynamic content pulled from a database be indexed by search engines?

I have blog posts stored in an Access database. They are displayed dynamically when ASP pages are loaded. Therefore, there are no publicly accessible files containing the bodies of blog posts except when the user accesses the pages. When search engines index my page, how can I ensure that the blog content is indexed and up to date? ...

Why is Yahoo Indexing Bot considered as "evil"?

After reading and commenting on this question PHP Library for Keeping your site index by Google, Bing, etc, I was curious to look at StackOverFlow's sitemap. This returned a 404 error which I am guessing is just a protected page by determining if your are a Index Bot or simply doesnt exists. This then lead me to look at the robots.tx...

How can create simple PHP Search Engine Submission tool?

How can create simple PHP Search Engine Submission tool, which allows that to submit the site URL to multiple search engines such as "http://www.google.com/addurl?q=www.yoursite.com&hl=&dqq=" and also "http://www.bing.com/webmaster/SubmitSitePage.aspx" or any other other search engines? ...

Assistance with building an inverted-index

It's part of an information retrieval thing I'm doing for school. The plan is to create a hashmap of words using the the first two letters of the word as a key and any words with the two letters saved as a string value. So, hashmap["ba"] = "bad barley base" Once I'm done tokenizing a line I take that hashmap, serialize it, and append i...

what exatly google.setOnLoadCallback(initalize) function means?

while coding javascript and ajax, there is no proper documentation for this function? i searched this term using api src="http://www.google.com/jsapi " and searchControl.execute("abhilashm86"); how does this google.setOnLoadCallback(initalize) called internally? is this function just for a new search term when user clears previous sea...

What are some free or inexpensive tools out there to search/index file systems (using .Net)?

I'm looking for a way to search a file system that contains approximately 1TB of documents in either Office or PDF format. Is Lucene.Net pretty much the best way to accomplish this? I've also heard of dtSearch, and was wondering if anyone had used that tool with any success? Are there any other tools out there that would do the job? ...

Google Custom Search Engine not giving the expected search result.

Hi, I have been trying to create a new google custom search engine, but when I try some query,the search engine it is not giving me the expected search result.On some queries it is working fine, but on other queries, it says"no result". I tried adding the URL of the website that I wanted to search for, but there are certain pages and ke...

how to create a pure browser/javascript search engine for a bunch of static files ?

Hello, I was wondering if there exists some kind of indexer (in python language for exemple) that would take a bunch of html files and create an index in javascript. In the solution, these javascript files would then be used to create a pure browser search engine. Jerome ...

How to get status code of a POST with Asp.Net HttpWebRequest

I'm trying to ping Google when my web site's sitemap is updated but I need to know which status code does Google or any other service returns. My code is below: HttpWebRequest rqst = (HttpWebRequest)WebRequest.Create("http://search.yahooapis.com/ping?sitemap=http%3a%2f%2fhasangursoy.com.tr%2fsitemap.xml"); rqst.Method = "POST"; rqst.Con...

How to build an interactive search engine web interface using python

I have build a static web interface for searching data from some tables in my PostgreSQL database. The query website consists of a simple textfield for entering the search term, the result website presents the results as a simple html table. The server side code for searching the PostgreSQL database and returning the results is written i...

looking to streamline my RSS feed mashup

Hello crafty developers, I have aggregated RSS feeds from various sources with RSSowl, fetching directly from the social mention API. The RSS feeds are categorized into the following major categories: blogs, news, twitter, Q&A and social networking sites. Each major category is nested with a common group of RSS feeds that represent a pa...

Cannot see my wordpress website on google search

Hi guys I recently uploaded a site made with wordpress. The site url is oakabeachvolley.gr I have set on the privacy settings of wordpress for the site to be visible by search engines. However after almost 45 days the site is invisible on google even when I'm searching using the url name and very specific keywords. Since I have ma...

What is the best way to optimize an iframe content in webpage for search engines

I use an iframe in my site , How i can let the search engines robots read and index the content , and the results point to the page including the iframe ? thanks. ...

Search relevance from XML docs (XQuery?) vs MySQL

Hello there, I have a website where documents are saved in xml documents, all with the same structure. I need a search engine where I am able to choose documents with the highest relevance according to the key words given by a searching user. I thought it could (?) be a good idea to have one using XQuery rather than having the informa...

Searchengine bots and meta refresh for disabled Javascript

Hi! I have a website that must have javascript turned on so it can work there is a < noscript> tag that have a meta to redirect the user to a page that alerts him about the disabled javascript... I am wondering, is this a bad thing for search engine crawlers? Because I send an e-mail to myself when someone doesn't have js so I can ...

How do i optimize my video website for all search engines?

I've a site which has numerous videos under various categories and already my video pages are crawled and indexed but i want them to listed as videos instead of indexed pages. So how do i optimize or whether i should submit any special kind of sitemaps to search engines? I want to focus on all search engines but more on google obvious...