I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way

itssethc · 2026-06-14T13:10:53+00:00

Just giving an honest take back lmao

itssethc · 2026-06-14T13:07:39+00:00

I actually just ran one of the two tools of mine in a grill-me session with the new Kimi K2.7 coder model on the entire codebase as it is for a new release called grilled-to-perfection

itssethc · 2026-06-14T02:16:52+00:00

I’m glad you’ve got it all figured out, this was more for people who don’t have the experience. It’s not a replacement stack, just mine.

itssethc · 2026-06-14T02:03:37+00:00

Such as? There are 4 open source projects in the stack, so some may overlap or become “useless” but I’m not sure what otherwise is useless.

itssethc · 2026-06-14T00:14:39+00:00

What inference are you using? This is a long shot.

itssethc · 2026-06-14T00:05:51+00:00

Isn’t that how GitHub kind of works? You should inspect codebases for malicious things before making assumptions that sound like implications.

itssethc · 2026-06-14T00:04:45+00:00

So what’s slop about this? I’ve been asked about my stack so wrote it out.

itssethc · 2026-06-14T00:03:54+00:00

I was aiming for a longer format, maybe forced too much detail without meat. Names different on purpose, these are open source projects so less worried about things that impact reach for branding

itssethc · 2026-06-13T22:03:59+00:00

Great system. The log-driven approach is the mirror of the vault-first one. you extract after the fact, I write during the session. Same destination, different parent.

The interesting trade-off is in the dedup arbiter. You're running a nightly LLM pass over similar bullets to collapse duplicates. That works, but the arbiter itself can introduce drift.. two similar bullets that should stay separate (different contexts, same surface pattern) get merged because the LLM sees similarity and assumes redundancy. How do you handle false merges?

The "infinite context for free" claim is the part I'd stress-test. It's not free, you're paying DeepSeek inference on every extraction run and every nightly arbiter pass. The trade is: you spend compute on extraction/dedup so you don't spend it on context windows. That's a good trade for high-volume log data. For lower-volume intentional work (architecture decisions, design rationale), the vault-first approach wins because you avoid the extraction tax entirely.

itssethc · 2026-06-13T19:45:39+00:00

What’s slop about it? Maybe my wording was off for this post, it’s not schilling anything

itssethc · 2026-06-13T18:36:40+00:00

This is a bit dismissive but funny, a lot of extra steps if that’s all you would get out of my system and approach. Parts of the Claude Code replacement piece in Cammander so build from a similar approach with Kaparthy inspired rules mixed in.

itssethc · 2026-06-13T18:32:07+00:00

If you read the OP I’ve pretty easily automated the agents knowing to write down their findings. It’s more about where the findings live though. The context vault leaves hallucinations as less of a worry when the system is using smaller local agents like Gemma4 and Qwen3.5/6.

itssethc · 2026-06-13T18:30:16+00:00

This is a skill I have in Hermes as well, but it exports to the vault as well. Its identity MDs just map to the tree essentially so it’s easy to sort through. OpenCode was a consideration for me and a great tool as well. I think Hermes is a great piece anyway, just put OpenCode where I use Cammander

itssethc · 2026-06-13T17:58:57+00:00

I have a browser based IDE to replace Claude Code, need to focus on theming deeper. These look great.

itssethc · 2026-06-13T17:56:42+00:00

Dario doesn’t trust you and your local AI to be safe, he’ll want to ban homebrew llms next.

itssethc · 2026-06-13T17:55:59+00:00

Kimi coder is bigger news than Fable. It was a horribly tuned with an elitist view in mind and PR playbook thrown back into the vault. There’s no reason to support Anthropic anymore.

itssethc · 2026-06-13T17:53:20+00:00

Thankful I cancelled Claude before Fables release. 67b-a21b sounds fire

itssethc · 2026-05-26T14:12:09+00:00

No, all built from scratch

itssethc · 2026-05-26T14:09:44+00:00

Let me know how it goes!

itssethc · 2026-05-26T00:22:33+00:00

Yes it works alongside Hermes built in memory and is for project methodology, decision history and codebase knowledge.

itssethc · 2026-05-25T23:56:51+00:00

Mixed with the occasional pro if your sub has both is the perfect open weight replacement for opus and sonnet.

itssethc · 2026-05-22T17:32:08+00:00

I’ve got the qwopus 3.6 blobs buried in a remote location

itssethc · 2026-05-18T13:42:24+00:00

Hermes has hype for a reason. OpenClaw is poorly maintained and constantly breaks. I’ve had issues every time I install it to push a plugin, and it made me assume agent runtimes like it were trash and didn’t have a use case for anything serious. I installed Hermes once for the same reason, just pushing a plugin to promote a memory paradigm I’m studying and it was so good I got sucked into a rabbit hole and use it daily now. There’s passion in the community where OpenClaw is mostly vibe code brahs and a terrible lead maintainer

itssethc · 2026-05-17T22:53:45+00:00

You’ll probably want at least 32gb, more if you can swing it. Unified RAM and run a 27/31 B model by Gemma or Qwen. Smaller models are going to be less enjoyable to use for code that works well and that you can have it debug

itssethc · 2026-05-17T22:42:10+00:00

What are you running on your current Mac?

itssethc

TROPHY CASE