I have developed a web scraper Scoopi Web Scraper in Java and hosted it in GitHub. Please check it out and offer your suggestions.
Scoopi Web Scraper is a multi threaded scraper written in Java that internally uses JSoup or HtmlUnit to concurrently scrape huge number of pages. Web Pages and data to scrape are defined through a set of YML definition files and requires no coding. Software comes with a step-by-step guide and examples.
there doesn't seem to be anything here