Hi all.
I have implemented a multithreaded crawler in C#. Using a custom threadpool, there's a job queue, all pages to be downloaded are queued up and each thread takes one and downloads.
using 15 threads, When crawling one site only, it's smooth as silk and gets done fast. When crawling several sites on different servers at the same time, I get TONS of timeouts.
Might this have anything to do with DNS resolve? What would you think would cause this to happen?
Thanks. Roey