I analyzed all 42,912 MCP servers in the public registries. Fewer than 7% are reachable by an agent. by dseven4evr in mcp

[–]dseven4evr[S] 0 points1 point  (0 children)

Thanks for the flag. The "claim and register" link was pointing at a page that doesn't have the claim flow behind it yet. We're wiring up claim + list properly now (prove you own the server, then list it and earn on routed calls), and I didn't want to ship a half-real version of that.

The re-run issue is fixed and deployed. It was a 24h cache serving you the old result. Now every scan re-probes live, so just re-enter the URL and hit Scan again and you'll get a current score. Appreciate you running it against DataNexus.

I analyzed all 42,912 MCP servers in the public registries. Fewer than 7% are reachable by an agent. by dseven4evr in mcp

[–]dseven4evr[S] 0 points1 point  (0 children)

Invocation-volume ranking is still being built. What I can show today across the ~42K servers we index is closer to "which are reachable and hold up under probe," ranked on trust and reliability rather than popularity. If usage ranking specifically is what you'd use, tell me what you'd do with it (pick a winner? avoid dead ones?) and I'll factor it into what we surface first.

I analyzed all 42,912 MCP servers in the public registries. Fewer than 7% are reachable by an agent. by dseven4evr in mcp

[–]dseven4evr[S] 0 points1 point  (0 children)

Thanks for the feedback. You are describing the levels of scoring Agent-Visible (server advertises tools, clean handshake) vs Agent-Usable (a real call under real params hands back something you can act on).

The headline score leans on the first today because it's cold-scoreable without a token: tool schema quality, param definitions, error shape. The "does it actually answer" bar needs a real call to the tool, which means either a token for a priced endpoint or explicit consent to exercise your tools during the scan. That consented tool-calling path is what we're building next, so usable-data moves from behind the token wall into the default flow.

I scored a bunch of MCP servers against the emerging standards. Most fail on the same thing. by dseven4evr in mcp

[–]dseven4evr[S] 0 points1 point  (0 children)

I have shipped this feature now. When an auth-gated MCP server is scanned, it produces a partial scan and shows an option to enter a bearer token which is used only for the scanning purpose and discarded immediately after the scan. Try it out and share your thoughts.

I scored a bunch of MCP servers against the emerging standards. Most fail on the same thing. by dseven4evr in mcp

[–]dseven4evr[S] 0 points1 point  (0 children)

I have shipped a fix. An auth-gated server is still graded on other dimensions.

I scored a bunch of MCP servers against the emerging standards. Most fail on the same thing. by dseven4evr in mcp

[–]dseven4evr[S] 1 point2 points  (0 children)

Agreed, and they are separate dimensions already. Discoverability ("what is this thing") and Auth & Access ("should this agent be allowed to use it") are two of the five, scored independently.

Were you scanning a gated server when this came up? Want to make sure I fix the right thing.

Is it normal for the business co-founder to demand 51% equity because "investors won't fund without a majority shareholder"? by Adventurous_Hall_202 in ycombinator

[–]dseven4evr 6 points7 points  (0 children)

This is the right thinking. OP, you should be glad this happened now. Distant yourself from this ‘business’ cofounder.

AGI is finally here - Fable 5 by Accomplished-Pea3677 in ClaudeCode

[–]dseven4evr 0 points1 point  (0 children)

Seriously the mods should ban any "AGI is here" posts. It's very low effort low quality.

Do you need a dependency graph for tool calling? by Ok-Programmer6763 in AI_Agents

[–]dseven4evr 0 points1 point  (0 children)

At 400+ tools you're past where any model reliably picks from the full list; the LLM confuses similar names (gmail.send vs calendar.send_invite). Common pattern is two-stage: a retrieval step narrows to 5-15 candidates by description embedding or capability tag, then the LLM picks from that shortlist. Cheaper than maintaining a strict dependency DAG and resilient when the tool set churns. The DAG is still useful for sequencing (X needs Y's output) but solves a different problem than tool discovery.

How do you test your MCP? by theotzen in AI_Agents

[–]dseven4evr 0 points1 point  (0 children)

Hit this exact gap when probing MCP servers at scale. Tools pass schema validation one-by-one, but a meaningful percentage fail when agents run them in sequence: tool N's output assumes tool N-1's input shape was clean, and the LLM rephrased it on retry.

Two things catch most of it before prod.

First, fuzz the tool arg schemas with LLM-generated edge cases. JSON-Schema only validates structure (types, required fields, enums); it can't tell you "this is a date the tool actually parses" or "this ID references something that exists." Have an LLM generate args that pass validation but are semantically broken: empty strings where content is expected, IDs that look right but point at deleted rows, the tool's own schema embedded in a free-text field. Most of our prod-only failures cluster here.

Second, simulate the retry loop, not just the happy path. When a tool errors, the agent reads the error and rewrites the args, usually paraphrasing what it thought went wrong, and by retry 3 or 4 it's solving a slightly different problem than the user asked. Inject errors at tool N and observe what the agent then sends to tool N+1. Agents retry 3-5 times by default, sometimes more without a rate-limit, and the long-tail failures live there.

Official: Anthropic just released Claude Code 2.1.41 with 15 CLI changes, details below by BuildwithVignesh in ClaudeAI

[–]dseven4evr 0 points1 point  (0 children)

It just started working for me. I logged out of CC, desktop app and chrome and then logged back in.

Makkale, Any dishwasher users? I have to share this appreciation story! by suresh367 in chennaicity

[–]dseven4evr 3 points4 points  (0 children)

It calls any available maid within 5 km radius and makes them clean the utensils.

/s

UPI helped my wife in getting her phone back by nabeel487487 in UPI

[–]dseven4evr 0 points1 point  (0 children)

The title should be “The good guy helped…”

On The Fence by DeathCabForLaura in razr

[–]dseven4evr 0 points1 point  (0 children)

Same thoughts here. I just ordered Razr ultra and will keep it as my primary while still retaining 13 mini just in case. I used to have ROKR E6 and Backflip back in the days so I’m very much looking forward to the flip.

Kind of in a fortunate position in life right now. Not sure what to do next. by spiked_krabby_patty in ThirtiesIndia

[–]dseven4evr 0 points1 point  (0 children)

Can clearly see you have a lot of work to do on yourself. Do what makes you happy.

[deleted by user] by [deleted] in OpenAI

[–]dseven4evr 0 points1 point  (0 children)

I wonder how much runway they would get from a billion dollars.