I am looking for a good open source bot to determine some quality, often required for google indexing.
For example
- find duplicate titles
- invalid links ( jspider do this, and I think a lot more will do this)
- exactly the same page, but different urls
- etc, where etc equals google quality reqs.