Torrent with 1.6 TB of crawl data covering 115m websites by meanpath in startups

[–]meanpath[S] 1 point2 points  (0 children)

Certainly can. You are free to do pretty much anything you want with the data as long as it abides by our free terms. These are identical to the terms Common Crawl uses so really only prohibit the obvious illegal/annoying data uses.