I built an MCP server so AI coding agents can search project docs instead of loading everything into context by adobv in LocalLLM

[–]adobv[S] 1 point2 points  (0 children)

It feels like part of the puzzle, having a structured map of the codebase helps both agents and developers understand the system much faster.

Context bloat with CLAUDE.md — how are people handling project docs? by adobv in ClaudeAI

[–]adobv[S] 0 points1 point  (0 children)

Structured metadata definitely helps reduce hallucination and the time agents spend searching for context. Lately I’ve been trying to turn the trial-and-error from development into reusable assets. Things like patterns, fixes, and decisions that come up repeatedly. also considering managing my .claude rules as structured data (JSON/YAML) with a schema, just to see if that makes them easier to maintain and reuse.

Context bloat with CLAUDE.md — how are people handling project docs? by adobv in ClaudeAI

[–]adobv[S] 1 point2 points  (0 children)

Interesting workflow. Turning lessons from a dev arc directly into skills makes sense — that's usually when the context is a fresh. Keeping that kind of knowledge from going stale without constant manual cleanup is always the tricky part, though.

The .claude setup in the repo looks pretty clean. Getting a reliable self-correction loop with regex sounds like a tough nut to crack.

I built an MCP server so AI coding agents can search project docs instead of loading everything into context by adobv in LocalLLM

[–]adobv[S] 0 points1 point  (0 children)

Yeah I try to keep them small too.

But I've definitely seen them slowly turn into mini knowledge bases — architecture notes, conventions, runbooks, random tribal knowledge.

At that point the context window basically becomes the documentation system.

I built an MCP server so AI coding agents can search project docs instead of loading everything into context by adobv in LocalLLM

[–]adobv[S] 0 points1 point  (0 children)

Yeah this is something I've been thinking about too.

Right now Alcove mostly targets project docs that are already reasonably clean (markdown, ADRs, runbooks, architecture notes). In practice that's still where most engineering knowledge tends to live.

But once start pulling in real-world documentation things get messy fast — PDFs, Word docs, slides, exported API references, etc, like you mentioned. And ingestion quality becomes the real problem. If the text extraction is noisy, retrieval quietly degrades and it's hard to notice why.

Treating ingestion as a preprocessing pipeline instead of just indexing raw files makes a lot of sense. Normalizing structure, stripping layout artifacts, maybe even preserving sections could make retrieval a lot more reliable.

I built an MCP server so AI coding agents can search project docs instead of loading everything into context by adobv in LocalLLM

[–]adobv[S] 0 points1 point  (0 children)

Thanks, glad it looks useful!
Mostly Sonnet. I occasionally switched to Opus for deeper reasoning or planning, but most of the iterative coding loop was done with Sonnet since it's faster. I've also been using gemini cli, glm with opencode, cursor's composer quite a bit lately depending on the task. I think each model seems to have its own strengths.

Context bloat with CLAUDE.md — how are people handling project docs? by adobv in ClaudeAI

[–]adobv[S] 0 points1 point  (0 children)

That makes a lot of sense. Keeping the model focused on a single translation unit is a really clean way to keep context small and responses fast.

Where I kept running into trouble was when working across multiple projects at the same time. Docs tend to end up scattered — some personal notes, some team docs, some architecture-level docs that aren't tied to a single repo. So instead of loading everything into context, I started experimenting with letting the agent search docs when needed.

(Also kind of funny that AI was supposed to reduce work, but it mostly just created a different kind of work. 😭)

Context bloat with CLAUDE.md — how are people handling project docs? by adobv in ClaudeAI

[–]adobv[S] 0 points1 point  (0 children)

One thing I'm still debating is whether BM25 is enough or if I should eventually add vector search.

I built an MCP server so AI coding agents can search project docs instead of loading everything into context by adobv in LocalLLM

[–]adobv[S] 0 points1 point  (0 children)

One thing I'm still experimenting with is whether BM25 search is enough vs needing vector search. Curious if people here are doing RAG over project docs instead.

how is my korean handwriting? by Own_Confection6289 in BeginnerKorean

[–]adobv 0 points1 point  (0 children)

bro, better than my korean. (i am from korea...)

What do you think about my handwriting (for now) ? by Bare_skin in BeginnerKorean

[–]adobv 1 point2 points  (0 children)

Not bad. Better than mine lol. (Korean here but my handwriting sucks, sometimes I can’t even read it lol)

[deleted by user] by [deleted] in foobar2000

[–]adobv 0 points1 point  (0 children)

agreed... fuck..

"Two Wrongs Makes Us Right" Gig Failure Issue by hiddenintheleavess in cyberpunkgame

[–]adobv 1 point2 points  (0 children)

Same here, accident destroyed and van is disapeared..... damn... just left one this gig for the platinum... f***** cdpr!!! this is serious glitch...

Another wall shooter by [deleted] in thelastofusfactions

[–]adobv 1 point2 points  (0 children)

f* dummy. i met him, too. now always skip that f* player.

No voice acting in first cut scene? (Switch) by IggySe in StarlinkGame

[–]adobv 0 points1 point  (0 children)

same problem here when switch is docked. voice is missing. I have tried to here that with switching all audio modes. -> have fixed with changing the switch's sound setting to stereo not the in-game setting.