tags:

views:

87

answers:

1

Some site have url pattern as www.___.com/id=1 to www.___.com/id=1000. How can I crawl the site using nutch. Is there any wway to provide seed for fetching in range??

A: 

I think the easiest way would be to have a script to generate your initial list of urls.

Pascal Dimassimo