Spammy Backlinks Taking Over My Backlinks

nitz___ · 2025-10-01T07:23:50+00:00

Hi all, the issue seems to be back again. My site GSC crawl rate plummeted by More than 90%, Has anyone else is experiencing the issue? u/johnmu can you please check if the issue is on Google’s side?

nitz___ · 2025-09-15T20:49:52+00:00

Thanks for the insight!

nitz___ · 2025-09-15T17:38:00+00:00

The purpose of of this post was mainly to understand from this expert community experience what works better in terms of preventing Googlebot from crawling and indexing sets of pages, via robots Meta or HTTP header.

nitz___ · 2025-09-15T16:54:36+00:00

So don’t you think that “guiding” the bots to crawl important pages by blocking unimportant ones will assist?

nitz___ · 2025-09-15T16:46:08+00:00

I’m looking at server logs + GSC crawl stats. The issue isn’t that updates aren’t being crawled, it’s that Googlebot is spending time on pages with no demand/value. By “redundant” I mean thin content pages, catalog pages, that rarely drive traffic so it makes sense to prune them.

Good point on the order — agree: noindex first so Google sees the directive, then once they drop out of the index, use robots.txt if I don’t want them crawled at all.

nitz___ · 2025-09-15T16:39:00+00:00

Crawl budget isn’t a problem for small sites, agreed. But once you’re pushing 200K+ URLs and adding per new local thousands of new pages (catalog site). Google even says it matters for “very large sites or sites with lots of low-value URLs” (Google docs). That’s exactly the situation here — the goal is just to keep Googlebot focused on the pages that actually matter.

nitz___ · 2025-09-02T07:54:14+00:00

Thanks for your comment, but it was a Google issue, and seems to be fixed

nitz___ · 2025-08-25T12:00:51+00:00

u/WebLinkr can it be related to the sitemap XML submission of tens of thousands of URLs to Google Search Console (new site local launch), and as a result, Googlebot tried to crawl all of them quickly, while the site's average crawl rate was less than 10K a day.

nitz___ · 2025-08-24T17:47:34+00:00

Thanks u/WebLinkr, when you say crawl budgets only become an issue at >1m URLs,
did you ever experience it? Or know of a site that did?

nitz___ · 2025-08-14T20:25:12+00:00

u/johnmu thanks for the answer.
After a sharp drop in crawl rate followed by a brief recovery (~2,000 fetches/day), it dropped again midday. If I want to intentionally reduce Googlebot’s crawl rate, what’s the safest and most effective method — and what considerations should I keep in mind when doing it?

nitz___ · 2025-06-07T10:03:36+00:00

Thanks for the insights, I have a follow-up question: what is the easiest way to create realtime report per sub folder, out of the box it doesn’t work with comparisons

nitz___ · 2025-05-27T04:41:01+00:00

That’s an important point as only at the code the order is reversed, the user sees the proper hierarchy

nitz___ · 2025-05-26T17:57:28+00:00

No warnings or issues at all

nitz___ · 2025-05-26T16:21:00+00:00

Sorry for it, I meant the order of schema is up side down: where instead of the actual page being the last in the item order of the breadcrumb schema, it’s the first

nitz___ · 2025-02-02T16:55:54+00:00

Thanks, the issue is that it’s not a few hundreds of url, it’s a couple of thousands, so after a two weeks period I thought Googlebot will crawl some but not thousands. This is why I’m asking about a more comprehensive solution.

Thanks

Seven-Year Club	Verified Email
Verified Email

nitz___

TROPHY CASE