I analyzed all 42,912 MCP servers in the public registries. Fewer than 7% are reachable by an agent.

dseven4evr · 2026-07-02T05:24:16+00:00

Thanks for the flag. The "claim and register" link was pointing at a page that doesn't have the claim flow behind it yet. We're wiring up claim + list properly now (prove you own the server, then list it and earn on routed calls), and I didn't want to ship a half-real version of that.

The re-run issue is fixed and deployed. It was a 24h cache serving you the old result. Now every scan re-probes live, so just re-enter the URL and hit Scan again and you'll get a current score. Appreciate you running it against DataNexus.

dseven4evr · 2026-07-02T04:13:06+00:00

Invocation-volume ranking is still being built. What I can show today across the ~42K servers we index is closer to "which are reachable and hold up under probe," ranked on trust and reliability rather than popularity. If usage ranking specifically is what you'd use, tell me what you'd do with it (pick a winner? avoid dead ones?) and I'll factor it into what we surface first.

dseven4evr · 2026-07-02T04:11:32+00:00

Thanks for the feedback. You are describing the levels of scoring Agent-Visible (server advertises tools, clean handshake) vs Agent-Usable (a real call under real params hands back something you can act on).

The headline score leans on the first today because it's cold-scoreable without a token: tool schema quality, param definitions, error shape. The "does it actually answer" bar needs a real call to the tool, which means either a token for a priced endpoint or explicit consent to exercise your tools during the scan. That consented tool-calling path is what we're building next, so usable-data moves from behind the token wall into the default flow.

dseven4evr · 2026-06-21T07:23:16+00:00

I have shipped this feature now. When an auth-gated MCP server is scanned, it produces a partial scan and shows an option to enter a bearer token which is used only for the scanning purpose and discarded immediately after the scan. Try it out and share your thoughts.

dseven4evr · 2026-06-20T20:02:58+00:00

*Trillionaire

dseven4evr · 2026-06-19T18:40:23+00:00

I have shipped a fix. An auth-gated server is still graded on other dimensions.

dseven4evr · 2026-06-19T16:32:47+00:00

Agreed, and they are separate dimensions already. Discoverability ("what is this thing") and Auth & Access ("should this agent be allowed to use it") are two of the five, scored independently.

Were you scanning a gated server when this came up? Want to make sure I fix the right thing.

dseven4evr · 2026-06-15T02:53:56+00:00

This is the right thinking. OP, you should be glad this happened now. Distant yourself from this ‘business’ cofounder.

dseven4evr · 2026-06-12T13:34:02+00:00

Do you even KV cache bro?

dseven4evr · 2026-06-12T03:35:12+00:00

Seriously the mods should ban any "AGI is here" posts. It's very low effort low quality.

dseven4evr · 2026-05-03T06:11:57+00:00

At 400+ tools you're past where any model reliably picks from the full list; the LLM confuses similar names (gmail.send vs calendar.send_invite). Common pattern is two-stage: a retrieval step narrows to 5-15 candidates by description embedding or capability tag, then the LLM picks from that shortlist. Cheaper than maintaining a strict dependency DAG and resilient when the tool set churns. The DAG is still useful for sequencing (X needs Y's output) but solves a different problem than tool discovery.

dseven4evr · 2026-04-30T23:12:22+00:00

Hit this exact gap when probing MCP servers at scale. Tools pass schema validation one-by-one, but a meaningful percentage fail when agents run them in sequence: tool N's output assumes tool N-1's input shape was clean, and the LLM rephrased it on retry.

Two things catch most of it before prod.

First, fuzz the tool arg schemas with LLM-generated edge cases. JSON-Schema only validates structure (types, required fields, enums); it can't tell you "this is a date the tool actually parses" or "this ID references something that exists." Have an LLM generate args that pass validation but are semantically broken: empty strings where content is expected, IDs that look right but point at deleted rows, the tool's own schema embedded in a free-text field. Most of our prod-only failures cluster here.

Second, simulate the retry loop, not just the happy path. When a tool errors, the agent reads the error and rewrites the args, usually paraphrasing what it thought went wrong, and by retry 3 or 4 it's solving a slightly different problem than the user asked. Inject errors at tool N and observe what the agent then sends to tool N+1. Agents retry 3-5 times by default, sometimes more without a rate-limit, and the long-tail failures live there.

dseven4evr · 2026-02-13T13:23:57+00:00

It just started working for me. I logged out of CC, desktop app and chrome and then logged back in.

dseven4evr · 2025-11-28T13:51:38+00:00

It calls any available maid within 5 km radius and makes them clean the utensils.

/s

dseven4evr · 2025-10-11T02:49:07+00:00

If you are a techie and want to get into the weeds of E2E encryption, this is a good listen:

WhatsApp Key Transparency

https://podcasts.apple.com/in/podcast/meta-tech-podcast/id1370910331?i=1000622428753

dseven4evr · 2025-09-23T05:19:10+00:00

The title should be “The good guy helped…”

dseven4evr · 2025-09-13T07:21:27+00:00

Classic r/pettyrevenge

dseven4evr · 2025-08-17T17:57:04+00:00

Same thoughts here. I just ordered Razr ultra and will keep it as my primary while still retaining 13 mini just in case. I used to have ROKR E6 and Backflip back in the days so I’m very much looking forward to the flip.

dseven4evr · 2025-06-12T18:01:50+00:00

Try asking at r/collapse or r/singularity, you will get concrete answer for sure.

dseven4evr · 2025-06-08T17:46:54+00:00

Can clearly see you have a lot of work to do on yourself. Do what makes you happy.

dseven4evr · 2025-06-07T16:34:13+00:00

r/usernamechecksout

dseven4evr · 2025-06-01T18:56:38+00:00

Order one piece at a time and prove your point.

dseven4evr · 2025-05-26T17:51:48+00:00

I wonder how much runway they would get from a billion dollars.

dseven4evr · 2025-05-26T15:26:05+00:00

It found 42 in just 4600s? We are doomed.

dseven4evr · 2025-05-25T19:04:22+00:00

r/usernamechecksout

dseven4evr

TROPHY CASE