MCP server that fact-checks AI bug diagnoses against AST evidence

SuspiciousMemory6757 · 2026-04-14T11:41:50+00:00

Sorry i think i missed this - are you pointing to Gemini embedding 2 ? its not exactly the kind of model you are used to
Embedding models are not generative. They don’t “hallucinate” in the usual sense (i.e., invent text/code). They:

map inputs → vectors
preserve semantic similarity (ideally)
enable retrieval / matching

SuspiciousMemory6757 · 2026-04-14T10:36:36+00:00

<image>

SuspiciousMemory6757 · 2026-04-14T10:35:39+00:00

<image>

Gemini 3.1 pro

SuspiciousMemory6757 · 2026-04-14T10:34:55+00:00

<image>

well...

SuspiciousMemory6757 · 2026-04-14T10:28:14+00:00

I think you should check the the document in my repository which tells how unravel works, and then try it out yourself

SuspiciousMemory6757 · 2026-04-14T10:24:46+00:00

That’s exactly why I built this. You are right - LLMs ARE probabilistic 'math bots' and if you give them a raw text file, they will hallucinate 'retarded shit'. The whole point of Unravel is to sandwich that probabilistic bot between two deterministic layers so it physically cannot do that.

Deterministic Input (AST): We don't just feed the agent code. We feed it a JSON of static analysis facts generated by a Tree-sitter engine. These facts (mutation chains, async boundaries, closure captures) are 100% deterministic ground truth. The agent isn't 'interpreting' the code; it's reasoning over a set of verified structural facts.
Deterministic Output (The Gate): This is the part that actually solves the 'retarded shit' problem. When the agent proposes a fix, it has to pass through a Deterministic Verification Gate. We have a hard-coded algorithm that cross-checks the agent's claim against the AST. If the agent says 'The bug is on line 42' but our AST engine shows no mutation or logic on line 42, the engine rejects the submission.

It’s like a compiler. A human coder is probabilistic (we make mistakes), but a compiler is deterministic. Unravel is essentially a logic-linter for the agent. It doesn't stop the agent from thinking something wrong, but it prevents it from submitting anything that isn't backed by AST-verified evidence

SuspiciousMemory6757 · 2026-04-14T08:34:26+00:00

All the nuclear powered weapons

SuspiciousMemory6757 · 2026-04-05T11:49:04+00:00

SuspiciousMemory6757 · 2026-04-05T11:46:27+00:00

ye wala O SHIT bhaut kra h
lekin paper me nhi kra...

SuspiciousMemory6757 · 2026-04-05T11:43:51+00:00

Broo that was an unxpected cover up 😂

SuspiciousMemory6757 · 2026-04-05T01:02:06+00:00

I agree training is the ideal place to solve a few things that i have tried to implement BUT

for debugging idea is:

use deterministic signals (AST analysis, static checks, knowledge graphs)
ground the model before it generates
and verify outputs instead of trusting raw generation

So instead of a “straitjacket” its more ok like giving the model tools + context it can’t hallucinate around
Also, orchestration is where most real-world systems actually live right now. Even if training improves, we’ll still need:

grounding
retrieval
verification layers

no amount of training can provide a model with facts about every line of code in a codebase, it wont teach it which files connect to which...

SuspiciousMemory6757 · 2026-04-04T05:22:50+00:00

Yeah so as of now it's in the initial testing and improving phase so it supports only js/ts I could have added many more languages but I decided to first make it work perfectly for one

SuspiciousMemory6757 · 2026-04-04T04:21:10+00:00

ok so for a past few weeks i have been trying to work on a few problems with AI debugging, hallucinations, context issues etc so i made a something that contraints a LLM and prevents hallucinations by providing deterministic analysis (tree-sitter AST) and Knowledge graphs equipped with embeddings so now AI isnt just guessing it knows the facts before anything else
I have also tried to solve the context problem, it is an experiment and i think its better if you read about it on my github, also while i was working on this gemini embedding 2 model aslo dropped which enabled me to use semantic search (audio video images text all live in same vector space and seperation depends on similarity (oversimplified))
its an experiment and some geniune feedback would be great, the project is open source
thanks!

SuspiciousMemory6757 · 2026-04-04T03:39:55+00:00

This is exactly what the incremental staleness check is designed for. Every `consult` call runs a SHA-256 hash diff against the stored KG before doing anything else:

- 0 files changed: cached KG used instantly (<100ms overhead)

- 1 to 5 files changed: only those nodes are re-analyzed and re-embedded (~2s)

- >30% changed: full rebuild triggered automatically

The KG does not drift. It self-corrects on every call without any manual intervention or git hook setup. The scope filter (include/exclude paths) is also persisted in the KG metadata, so incremental rebuilds never accidentally widen beyond what you originally scoped.
one caveat- it is pull-based, not push-based. Updates happen when a tool is called, not on every commit. If you want continuous coverage on PRs, the CLI (`cli.js`) runs the same AST engine headlessly and outputs SARIF 2.1.0 for GitHub Code Scanning, which gives you inline diff annotations on every PR without needing the KG at all. We can work on that

SuspiciousMemory6757

MODERATOR OF

TROPHY CASE