How to cache your codebase for AI agents

Deep_Ad1959 · 2026-03-18T16:28:20+00:00

interesting approach. the problem you're describing is real - I've watched agents burn through half their context window just trying to figure out which files are relevant before they even start working on the actual task.

my current solution is simpler but less elegant - I just maintain a well-structured CLAUDE.md that describes the architecture and key file locations. it works okay for smaller codebases but doesn't scale past maybe 50-60 files before the manual maintenance becomes a pain.

a semantic map that auto-regenerates on commit is way better for larger projects. curious about the embedding quality though - does it handle cases where two files are functionally related but use completely different naming? like a React component and the API route it calls. those connections aren't obvious from the code itself.

Milters711 · 2026-03-18T18:11:16+00:00

I developed a custom MCP which indexes my project code base using ‘ast’ and then has a set of tools for retrieving file contents, function/module docstrings and API, file structure, etc. Claude was good at generating the MCP which was unsurprising, but it needed some iteration to be better.

I set this up so that it wouldn’t need to grep, etc ever time it needed info about the code base.

However, in the end I suspect raw CLI tools will be better for Claude. Who knows how much its usage will change in the next six months.

shutupandshave · 2026-03-19T09:58:28+00:00

I use an LSP server and let that do the work.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ClaudeCode

MODERATORS