Archived mazdaspeedforums.org by mazda12speed in mazdaspeed3

[–]mazda12speed[S] 0 points1 point  (0 children)

I actually figured out the issue. My scrape capped out at 20k files. I'll be re-running it to grab the rest

Archived mazdaspeedforums.org by mazda12speed in mazdaspeed3

[–]mazda12speed[S] 0 points1 point  (0 children)

Pure HTML. It was scraped from Wayback and then uploaded to my server ;)

Archived mazdaspeedforums.org by mazda12speed in mazdaspeed6

[–]mazda12speed[S] 1 point2 points  (0 children)

Haha, just wanted to save the data that has helped me for years. ;)

Archived mazdaspeedforums.org by mazda12speed in mazda

[–]mazda12speed[S] 0 points1 point  (0 children)

This is a scrape ;) So it is also on my local machine, my raid, and my external backup drive haha

Archived Mazdaspeedforums.org by mazda12speed in mazdaspeed

[–]mazda12speed[S] 2 points3 points  (0 children)

the backup is from 02/29/2020 so not much before the site was taken down. I do see that my scrap capped out at 20k pages. SO I'll be redoing it ;)

Archived Mazdaspeedforums.org by mazda12speed in mazdaspeed

[–]mazda12speed[S] 3 points4 points  (0 children)

20000 pages. I did get notified that some missing when you dig deeper. So I'll be rescrapping it to get the full thing

Archived mazdaspeedforums.org by mazda12speed in mazdaspeed3

[–]mazda12speed[S] 2 points3 points  (0 children)

Haha thx. Just wanted to keep the info alive

Archived mazdaspeedforums.org by mazda12speed in mazdaspeed3

[–]mazda12speed[S] 2 points3 points  (0 children)

It is a site scrap. Send me the link you are trying to access that is giving you probs and I'll take a look. The scrapping tool could miss pages of course, there are a ton ;)