hi I wanted to know which is the best webservice provider to choose if I want to write a multi threaded crawler. Appengine,ec2....
Appengine java apparently has some restrictions .. also a free would be great as i am just testing the app.
hi I wanted to know which is the best webservice provider to choose if I want to write a multi threaded crawler. Appengine,ec2....
Appengine java apparently has some restrictions .. also a free would be great as i am just testing the app.
The thing to be aware of is, a crawler is not a Web application. Well, other than using the Web for input. You may eventually build a Web application that uses your crawler's output, but a crawler and a Web service provider are a poor match. A "real" Web application is one that people from all over the world access with their browsers, and which returns Web pages to those browsers.
Assuming that the world needs yet another crawler, why don't you run it on a PC at your home and hook that into the 'net with a standard consumer cable connection? You'll have great bandwidth, you're probably cheaper off to buy your own CPU power rather than to rent it from elsewhere, and you have full control of your system.