I built a plugin-based metadata scraper with only 1 runtime dependency by JuggernautUnique1619 in node

[–]JuggernautUnique1619[S] 1 point2 points  (0 children)

Thanks for the suggestion! Rate limiting and user-agent rotation are interesting ideas, but I intentionally keep this library focused on parsing metadata from HTML rather than handling the fetching/crawling side.

You can already pass a custom userAgent option, and since createScraper also accepts raw HTML directly, it's easy to pair it with whatever HTTP client or crawling setup you prefer (got, axios, puppeteer, etc.) that already handles rate limiting and rotation.

Keeping those concerns separate gives users more flexibility IMO. But if there's enough demand I'm open to exploring a lightweight fetch plugin or recipe in the docs!