search-engine

Is there any trick to access facebook accounts or any other site database?

Can we access the database of facebook or any other social networking site like google which gives us the details for any existing facebook account. How google can access facebook database while yahoo or any other search engine are unable to do so? ...

I have created inverted index for a website but where to store that? Database for a search engine?

What can be the database for a search engine? I mean after creating inverted index for a site, where one could store it so that program can create indices for other sites and save them too. Later on indexer can query them also. Because indices can range in thousands of billions. Thanksyou ...

I want to make a "Frequent searches" engine. ¿directions?

Hi, this post is just to know if anyone here has done something similar and what do you think about the right way to do that. My first idea was start storing every word in the db ignoring words with 2 or less characters and every time a word is repeated just add one to a counter row (say importance) to appear first on the list of frequ...

Bing.com core technology?

We all know bing.com is a Microsoft site, hence it's technology is built upon Microsoft stuck (I can hardly imaging something different). Does any one have an idea about what technologies are used in Bing.com (for serving the pages, for indexing the web, for searching the database.. etc)? /* Strange question you say?! C'mon it's StackOv...

Can you use Search Server 2010 Express with Windows Sharepoint Services 3.0?

We are using Search Server 2008 Express and WSS 3.0. Can we upgrade to Search Server 2010 and still keep WSS? We can NOT upgrade to Sharepoint 2010. Anything I should watch out for if we try to upgrade? Is it even worth doing? Search Server 2010 Express http://technet.microsoft.com/en-us/enterprisesearch/ee263912.aspx#tab=1 ...

Large amounts of static data to populate a webpage - store in a database, or static?

I have a large amount (several gigabytes worth) of archival data that I want to make available to users and search engines through a web interface. Most of the data will rarely change, so I'm debating the best way to store and deliver the data. I would like to ensure that the data loads quickly and efficiently so it can easily be viewed...

Search engine against IEnumerable objects

Hi, We have a .NET application on which has grown beyond what we can develop in-house to search. We can certainly try to make the best search algorithm we can be it's not what we do and someone else has certainly done it better. We have an IEnumerable of object which has a title and a description and we'd like to search and rank it b...

error in your SQL syntax - Getting this error and cannot pinpoint where the error is

I am building a search function on my website and I am having some trouble locating an error in the SQL. There are four SQL statements used to return the counts of the same search using the AND and the OR keywords. The second set of SQL statements return the actual results of the AND and the OR searches limited according to the page numb...

Simple search with Linq To SQL

What is the problem with this query and how can I fix it? public JsonResult Find(string q) { var k = new List<string>(q.Split(' ')); return Json(_dataContext.Jobs .OrderBy(p => new List<string>(p.Keywords.Split(' ')).Where(n => k.Contains(n)).Count()) .Select(p => new { p.Title, p.IsFullTime, p.Location, p.Categ...

In robots.txt, what will Disallow: /?s block?

What will this line block when the search engine crawls the website? Disallow: /?s ...

Twitter Live Search

I was trying to reverse engineer Twitter-Live Search. Maybe we could discuss it here. I am talking about the feature where Tweets are shown even latest to "1 sec ago" etc. Trying to understand how the following might happen - There must be some layer between when the user tweets & when the index (updates) happen. Is this layer MySQL or...

Displaying sample text from the Lucene Search Results

Currently, I am using Lucene version 3.0.2 to create a search application that is similar to a dictionary. One of the objects that I want to display is a sort of "example", where Lucene would look for a word in a book and then the sentences where the words were used are displayed. I've been reading the Lucene in Action book and it menti...

Does Participating in social network increase page rank?

HI, I would like to know whether particpating in social networking sites and placing my website link there increases backlink. And also does search engine crawl those links. Please provide me suggestions? ...

Sphinx Configuring Multiple Sources

We have a modular site and would like to setup searching by module. For example, we have modules: news, customers, inventory So if the user is on the news module and they do a search, the results should only be from the news table. I've been reading Sphinx documentation but haven't seen if this is possible or not. We are using PHP an...

Navigation links in Google (or other search engines) results?

Hello all, Search for 'OpenOffice' on Google, and you'll see that the first search result's description contains navigation links and an option to "Search openoffice.org". If you search for 'WinSplit Revolution' or 'Stack Overflow' or even 'ftp client' , you'll get similar links. Is this something Google does by itself or can it be con...

making my search engine not refresh the page

Hello, I have a basic search engine and I was wondering if it can be done so I don't have to refresh my page, it also has a pagination to it. This is my form.php <?php global $search_term; global $location_term; ?> <form action="index2.php" name="form" style="float:right;width:650px;height:60px;margin-right:30px;" onSubmit="retu...

How does enterprise search display results for the user and hide unauthorized results?

I am looking to understand how enterprise search solutions tackle the issue of user-permissions. My question is on displaying the search results for users. The naive approach would display the search results to the user, and then if the user clicks a document he is not authorized to see, he will fail to open it. However, it is even forb...

is there an algorithm to find out which words in a search-string belong together?

hello guys, I was thinking about text driven search by user input. often you are searching in a database of addresses, where you can find customers and so on. has anybody any idea how to find out which of the typed words is the name, which is the street name, which is the company name? and secondly if the name is a double name like "L...

Our SOLR instance seems to be single-threading and therefore not taking advantage of its multi-proc host

We are running SOLR 1.4.1 (Lucene 2.9.3) on a 2-CPU Linux host, but it seems that only 1 CPU is ever being used. It almost seems like something is single-threading inside the SOLR application. The CPU utilization is very seldom over 0.9 even under load. We are running on virtual Linux hosts and but our other apps in the same cluster a...

Search Engine Help

I have a mediawiki installation that I've customized with some of my own extensions. Here is the basic platform, pretty standard LAMP install. Ubuntu Server Apache 2 Mediawiki 1.15 PHP 5.2.6 MySQL 5.0.67 For the actual MW search I use Lucene (EzMwLucene). I also have custom extension that displays tabular data from a separate datab...