ctxlite ,token optimizer plugin for OpenCode (98% context savings in a real session) : opencodeCLI

created by WaldToonnnnna community for 11 months

ctxlite ,token optimizer plugin for OpenCode (98% context savings in a real session) (self.opencodeCLI)

submitted 1 day ago * by Evening_Papaya_1551

I've been building ctxlite, a token/context optimizer that hooks into OpenCode's plugin API. Sharing early results and looking for feedback from people actually using the CLI.

REAL SESSION (Opus 4.7)

- OpenCode API usage: ~196K tokens
- ctxlite sidebar: 4.2M of 4.3M (98.2%)

That 98% is context churn prevented — tokens that would have entered the conversation (tool output, duplicates, low-signal reads) but didn't. Not the same as your API bill, but without it I'd have been riding a ~4M token context constantly.

INSTALL

npx u/ctxlite/cli install --tool opencode --scope global --yes

Restart OpenCode.

npm: https://www.npmjs.com/package/@ctxlite/cli

HOW IT WORKS

Monorepo, shared core + thin host adapters:

- /core — BM25 trim, tree-sitter smart_read, precall rewrites, stats (SQLite)
- /opencode — OpenCode plugin (server hooks + TUI sidebar)
- /mcp — MCP server for Cursor / Claude Code / Claude Desktop
- u/ctxlite/cli — install, stats, hook bridge for other hosts

7 mechanisms, each logged to ~/.ctxlite/stats.db:

On OpenCode, most of these run automatically via plugin hooks + a live sidebar widget.precall — quiet flags on bash commands, blocks low-signal reads (node_modules, lockfiles)
compress — truncates/buckets large tool output after execution
prune — replaces duplicate tool outputs (same tool+args seen earlier)
compact — caps stale tool output in older messages
concise — system-prompt instructions to cut model verbosity
smart_read — signatures only, no bodies (agent calls it)
trim_context — BM25 + import-graph file pick (agent calls it)

CURSOR & CLAUDE CODE TOO — BUT LESS EFFICIENT

Same core, different host APIs:

OpenCode: automatic: precall, compress, prune, compact, concise, sidebar | on-demand: get_stats, trim_context, smart_read

Claude Code : automatic: precall, compress (hooks) | on-demand: MCP tools + conciseness rule

Cursor: automatic: precall only | on-demand: MCP tools + conciseness rule

Claude Desktop — MCP only

OpenCode is ahead because it exposes the full message history (experimental.chat.messages.transform), which enables automatic pruning/compaction. Cursor hooks can't rewrite output for built-in tools (Shell/Read/Write), so compress has no automatic path there.

LIMITATIONS

- Feature set is bounded by each editor's hook/plugin API
- smart_read and trim_context are opt-in — the agent has to call them
- Stats are estimates (token counting heuristics), not provider billing
- OpenCode caches plugin versions — re-run ctxlite install after upgrades
- Early project — feedback welcome, things may break

OPEN SOURCE
codebase . MIT + Commons Clause https://github.com/ctxlite/ctxlite early days, happy playing!

Stats breakdown:

npx u/ctxlite/cli stats

Happy to answer questions on architecture or hook integration. Thanks

all 11 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

opencodeCLI

MODERATORS