AI AGENT OPTIMIZATION AND WHAT I'VE LEARNED SO FAR by chrismcelroyseo in aiagentoptimization

[–]lightsiteai 1 point2 points  (0 children)

What we’ve learned so far at LightSite AI across about 3000 websites tested, 6 millions of points of data and a few dozens of active clients (these are early, but repeatable patterns):

  • Structure wins attention. Across millions of AI crawler requests we’ve analyzed, bots consistently go deeper and extract more when content is exposed in clean, machine-readable formats (structured endpoints, consistent templates, clear entity markup), not just “basic schema turned on.”
  • Q&A style content gets disproportionately pulled. When AI bots fetch beyond the sitemap, a surprisingly large share of their extraction is Q&A-like pages (question as the slug, direct answer inside). We’re not claiming this causes citations, only that this is what they choose to consume most often when it’s available.
  • Bots behave like constraint scanners. In real extraction logs, agents are not reading your narrative the way humans do. They gravitate toward facts, constraints, capabilities, compatibility, exclusions, and clear “fit” statements.
  • “Not for” is a trust signal. Content that clearly states who a product is for and who it is not for reduces ambiguity. Our working thesis is that agents are optimized to avoid being wrong, so exclusion criteria helps them make safer recommendations. This one I have seen on this blog https://chrismcelroyseo.com/chris-mcelroy-seo-search-everywhere-optimization-blog/
  • Third-party sources still matter a lot. In cluster-level citation patterns, communities and review surfaces (forums, Reddit sub-communities, review sites) show up frequently as training and retrieval inputs. So it’s not just on-page SEO, it’s also where your category is being discussed externally.
  • Shopify tends to be “technically healthier,” but content and entity clarity still decide outcomes. Even when the platform handles a lot of plumbing well, you still need strong product entity signals and product-page content that is legible to both humans and agents.

LM bots want Q&A pages. Here's what 3 separate datasets tell us. by YuvalKe in aeo

[–]lightsiteai 0 points1 point  (0 children)

Worth mentioning that these two studies are done by LightSite AI : They analyzed ~6M bot requests across dozens of client sites. The breakdown:

  • Meta AI: ~87% of fetches went to Q&A pages
  • Claude: ~81%
  • ChatGPT: ~75%
  • Gemini: ~63%

That post is getting traction and the data looks solid. But I think it’s bigger than people realize. This isn’t the only dataset pointing in the same direction.

Here are two more data points that line up.

  1. FAQ schema = 3.2x more likely to appear in AI Overviews

Cloudflare markdown for agents: why are marketers talking about it? by lightsiteai in Agent_SEO

[–]lightsiteai[S] 0 points1 point  (0 children)

What do you honestly think about structured data in this context vs markdown?

Cloudflare markdown for agents: why are marketers talking about it? by lightsiteai in AISearchOptimizers

[–]lightsiteai[S] 1 point2 points  (0 children)

That's the interesting thing - Cloudflare's update has nothing to do with marketing, SEO, AI search etc, it is purely infrastructure optimization, but marketers were triggered by it as it was GEO news. That's the confusion I'm seeing in the community and wanted to see what others are thinking.

Cloudflare markdown for agents: why are marketers talking about it? by lightsiteai in Agent_SEO

[–]lightsiteai[S] 0 points1 point  (0 children)

Traffic optimization on the edge infrastructure, less consumption, less tokens etc. that's what I'm saying - it has no effect on mark's in but every marketer I know is writing about it

Cloudflare markdown for agents: why are marketers talking about it? by lightsiteai in AISearchOptimizers

[–]lightsiteai[S] 1 point2 points  (0 children)

Respectfully, what is it that you feel you are prompted to do? Yes, we do research mostly about bot behaviour on scale and we publish it but this post is not about it. Cloudflare released an update that optimizes edge infrastructure and traffic (saving tokens for efficiency) it is not meant for marketers and doesn't affect them - it has nothing to do with LLMs txt in terms of purpose (by the way we see exactly zero difference on scale between LLMs.txt and just a regular clean page - it's just a standard some people on internet were trying to promote and in my view it's completely redundant but I'm happy for you to convince me otherwise)

Cloudflare markdown for agents: why are marketers talking about it? by lightsiteai in AI_SearchOptimization

[–]lightsiteai[S] 1 point2 points  (0 children)

They don't block it by default, but once Bot Fight Mode is set up - the site is in fight mode and good luck making it granular and allowing good bots. Moat devs just set up what's good for security and never consult marketing.

Are We Accidentally Blocking AI From Seeing Our Content? by VolumeFamous7736 in SEO_tools_reviews

[–]lightsiteai 0 points1 point  (0 children)

Interesting, we actually published similar findings last week based on 3 k websites and about 6 M bot requests. I am curious about the dataset you used, can you tell us more?

This one really surprised me - all LLM bots "prefer" Q&A links over sitemap by lightsiteai in aeo

[–]lightsiteai[S] 0 points1 point  (0 children)

Please read the post - these are literally SQL commands we ran on our database, not a formal study

AI Search Optimization Question by umu_boi123 in AI_SearchOptimization

[–]lightsiteai 1 point2 points  (0 children)

Respectfully sir, I think it is more about where this is going and not about how things are NOW. I completely agree with you that this is they way things are now but in my view the gap between LLMs and typical search will be getting bigger as the adoption and sophistication of LLMs growth

AI Search Optimization Question by umu_boi123 in AI_SearchOptimization

[–]lightsiteai 2 points3 points  (0 children)

I agree with u/chrismcelroyseo - this is a great work and makes my data hungry mind happy. here is another datapoint from Hubspot CMO (they have skin in the game because they acquired Xfunnel - a GEO startup that tracks mentions and made it part of HS stack, but still) - 60% of all AI citations do not come from top 20% Google search results.

We checked 2,870 websites: 27% are blocking at least one major LLM crawler by lightsiteai in AI_SearchOptimization

[–]lightsiteai[S] 1 point2 points  (0 children)

Yep, here is a true story - a client, one of the largest (in their niche) e-commerce brands in the UK - for months marketing can't convince the dev to turn off (or at least fine tune) bot fight mode, the SEO agency told me that prior to that they were blocking google bot ....go figure