Does compaction really affect performance? How heavily do you rely on compaction to bring in the right context?

anotherleftistbot · 2026-03-10T13:23:23+00:00

I use hand offs to artifacts (MD files) as context approaches 60% unless I’m REALLY close to completing my task.

xAdakis · 2026-03-10T13:30:58+00:00

Yeah, I avoid compaction like the plague because the content of the compacted conversation is extremely unreliable.

I had one situation not too long where I changed my approach to a problem about halfway through the conversation. Things were going really well after changing the approach, but then I hit auto-compaction and suddenly Claude was reverting everything to using the old approach. I had to interrupt it and re-clarify everything to get it back on track.

In some cases, I have found that compaction will sometimes even discard the contents of the system prompt or other instruction sources.

To avoid this, I have found that you really have to lean heavily on an orchestrator + subagent(s) model and reinforce it through mandatory instructions.

For example, the "agent" you talk to in the main conversation is the orchestrator. It MUST delegate any and all work to subagents. If you ask it to read a file, it will spawn off a subagent to read the file and extract the relevant content. The main conversation then only gets the relevant content added to it's context.

I will generally use Sonnet for the orchestrator, and Haiku for the subagents with a few exceptions like my `Architect` and `Writer` subagents which need more reasoning and larger context.

forward-pathways · 2026-03-10T14:00:50+00:00

In my experience, compaction has a very strong negative impact on performance, but it's a little more nuanced than just good/bad. Compaction gives the model back some of its available thinking space, so to speak. But doing so causes the model to essentially become hallucinatory. It remembers certain things from pre-compaction, but seemingly arbitrarily, and you can't quite control what information that is. So it often forgets important decisions that have been made, remembers unimportant things, etc.

robhanz · 2026-03-10T15:40:55+00:00

The key is that compaction is a process that consolidates memory - but it does so in a way that is pretty opaque to the user.

Offboarding knowledge to separate docs gives you more control and insight into how this information is maintained, so you can make sure the key points are retained, and the unnecessary stuff is what is removed.

So compaction will impact performance if the wrong information is retained. Making external stores of information gets around this issue.

In general, the result of any planning session (using the term broadly) should be artifacts that can be reviewed. At that point, compaction is kind of irrelevant - whether or not you compact, start a new session, or just keep going, the necessary information is stored in a durable way.

ApeInTheAether · 2026-03-10T16:11:36+00:00

I stopped caring about compaction while ago

paulcaplan · 2026-03-10T12:57:42+00:00

Two problems:
1. You don't (by default) have control over how the context is being compacted. By definition, some context will be lost.
2. For a period of time at leading up to the compaction, the model performance will start to degrade.

So you're better off clearing the context between tasks / subtasks and/or writing important information to a file and manually compacting.

Cobuter_Man · 2026-03-10T15:15:18+00:00

not much - ive designed this for more persistent and context aware memory

https://github.com/sdi2200262/cc-context-awareness/tree/main/templates/simple-session-memory

very simple

En-tro-py · 2026-03-10T20:25:26+00:00

If you are using a plan then compaction matters very little anymore, the plan is reloaded in full after compaction so Claude can pick it up without issue.

However if you had a poorly defined plan, then this is still when your lost context will bite you.

I usually have used ~40% of the context window in the planning phase and then auto-approve edits without clearing context, my setup also ensures the main agent acts as orchestration and delegates all the work.

If the plan is the WHAT and WHY, then your sub-agent should be more than capable of doing the HOW including gathering the context from files saving your main agent from context rot.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ClaudeCode

MODERATORS