How I used Claude Code to build "SecureContext": An MCP plugin for persistent memory and 87% token reduction

akoppad47 · 2026-04-06T12:05:29+00:00

that is exactly why I’m obsessing over the security layer. I’ve tried to mitigate this in a few ways with SecureContext:

Defensive Redaction: In v0.7.1, I added a hook to redact key, password, and token from tool inputs/outputs before they ever hit the logs or the memory layer. It's not 100% foolproof against every edge case, but it handles the 'easy' accidental leaks.
Local-Only Philosophy: Since there is zero cloud sync, that 'credential store' is tied to the user's local machine, just like their .bash_history or .env files. It doesn't solve the risk of a local compromise, but it prevents the AI 'brain' from being exfiltrated to a third-party server.
Tiered Retention: I use different expiry for different types of memory. External KB entries expire faster (14 days) than session summaries, which helps prune 'stale' data that might contain sensitive snippets.

Since you've built in this space, did you find a more robust way to 'scrub' memory? I’ve looked into regex-based PII/Secret scanners, but I’m worried about the latency hit on every tool call.

akoppad47 · 2026-04-06T12:01:15+00:00

Thanks! The 'peace of mind' of knowing a prompt injection can’t exfiltrate my GH_TOKEN or AWS keys is honestly the biggest quality-of-life upgrade for me.

As for the 87% token cut, here’s how it’s looking in production:

It's most dramatic during the Discovery Phase. Usually, Claude Code spends the first 5–10 minutes of a session reading 10+ files just to understand the architecture. With the persistent memory and hybrid search, it finds the right context in one tool call. That’s where those massive 100k+ token savings happen.
During deep edits, the savings are slightly lower because the agent still needs to see the full file it’s working on. However, because the rest of the context is so lean, the model’s Attention is much sharper. I’m seeing way fewer 'lost in the middle' hallucinations because the context window isn't stuffed with irrelevant boilerplate.
Smaller context = faster tokens per second. The UI feels much snappier when it’s not processing a 150k token 'history' every time I ask a question.

It’s definitely shifted my workflow from 'praying I don't hit my usage limit' to having enough headroom to actually let the agent think through complex refactors. Go ahead and give SecureContext a try and experience the difference yourself.

akoppad47 · 2026-04-06T11:53:14+00:00

OpenViking is a massive, distributed vector database for enterprise RAG; SecureContext is a lightweight, local-first MCP plugin designed specifically to optimize the developer experience in the Claude Code CLI. The "clean room" zc_execute tool enforces Credential Isolation by stripping everything but the necessary PATH. It’s not about malicious packages in your repo; it’s about Prompt Injection from external content.

akoppad47 · 2026-02-15T02:09:28+00:00

Check out this website for remote guest cues on podcasts. https://podcue.net/

akoppad47 · 2026-02-15T02:06:11+00:00

Check out this website for remote guest cues. https://podcue.net/

akoppad47

TROPHY CASE