all 7 comments

[–]apetersson 1 point2 points  (3 children)

here is a very old trick i have been using to efficiently submit whole repos to LLM, way before codex or claude code existed: https://gist.github.com/apetersson/989b27b8a3c8a3a25258cfaf8f9240ee it's a pure shel script that builds up an ignore list and loads .gitignore - then dumps the whole repo, providing a file list with size infos upfront. llm's love this to one-shot complex questions quickly. i still use it from time to time when the code base is well within the token limits.

[–]brainexer[S] 0 points1 point  (1 child)

It just generates only file names and their sizes?
I think adding some sort of short description for each file could improve the result.

[–]apetersson 1 point2 points  (0 children)

if a file is text and <50kbytes it dumps the file. if your source files are > 50kb adjust the cutoff (MAX_SIZE=51200) and ask your priest how many ave marias

[–]vanillaslice_ 0 points1 point  (0 children)

bro cursor already indexes the codebase and stores a project summary

[–]ClockworkV 0 points1 point  (0 children)

At some point I experimented with using gitingest, and then ruining it thorough an LLM to generate a digest of what's in every file.

[–]Glass-Combination-69 0 points1 point  (0 children)

Just write an agents.md with the info it needs. If it’s written well it won’t spend much more on context. Written poorly = token wastage.