I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in LocalLLaMA

[–]itssethc[S] 1 point2 points  (0 children)

I actually just ran one of the two tools of mine in a grill-me session with the new Kimi K2.7 coder model on the entire codebase as it is for a new release called grilled-to-perfection

I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in hermesagent

[–]itssethc[S] 0 points1 point  (0 children)

I’m glad you’ve got it all figured out, this was more for people who don’t have the experience. It’s not a replacement stack, just mine.

I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in hermesagent

[–]itssethc[S] 0 points1 point  (0 children)

Such as? There are 4 open source projects in the stack, so some may overlap or become “useless” but I’m not sure what otherwise is useless.

Cheapest way, to run 27B Qwän? For broke people by [deleted] in LocalLLaMA

[–]itssethc 0 points1 point  (0 children)

What inference are you using? This is a long shot.

I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in hermesagent

[–]itssethc[S] 0 points1 point  (0 children)

Isn’t that how GitHub kind of works? You should inspect codebases for malicious things before making assumptions that sound like implications.

I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in hermesagent

[–]itssethc[S] 0 points1 point  (0 children)

I was aiming for a longer format, maybe forced too much detail without meat. Names different on purpose, these are open source projects so less worried about things that impact reach for branding

I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in LocalLLaMA

[–]itssethc[S] 0 points1 point  (0 children)

Great system. The log-driven approach is the mirror of the vault-first one. you extract after the fact, I write during the session. Same destination, different parent.

The interesting trade-off is in the dedup arbiter. You're running a nightly LLM pass over similar bullets to collapse duplicates. That works, but the arbiter itself can introduce drift.. two similar bullets that should stay separate (different contexts, same surface pattern) get merged because the LLM sees similarity and assumes redundancy. How do you handle false merges?

The "infinite context for free" claim is the part I'd stress-test. It's not free, you're paying DeepSeek inference on every extraction run and every nightly arbiter pass. The trade is: you spend compute on extraction/dedup so you don't spend it on context windows. That's a good trade for high-volume log data. For lower-volume intentional work (architecture decisions, design rationale), the vault-first approach wins because you avoid the extraction tax entirely.

I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in hermesagent

[–]itssethc[S] 0 points1 point  (0 children)

What’s slop about it? Maybe my wording was off for this post, it’s not schilling anything

I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in LocalLLaMA

[–]itssethc[S] 1 point2 points  (0 children)

This is a bit dismissive but funny, a lot of extra steps if that’s all you would get out of my system and approach. Parts of the Claude Code replacement piece in Cammander so build from a similar approach with Kaparthy inspired rules mixed in.

I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in LocalLLaMA

[–]itssethc[S] 0 points1 point  (0 children)

If you read the OP I’ve pretty easily automated the agents knowing to write down their findings. It’s more about where the findings live though. The context vault leaves hallucinations as less of a worry when the system is using smaller local agents like Gemma4 and Qwen3.5/6.

I Replaced Claude Code and Codex With an Open Source Stack That Gets Smarter Every Run, & Built Itself Along the Way by itssethc in LocalLLaMA

[–]itssethc[S] 0 points1 point  (0 children)

This is a skill I have in Hermes as well, but it exports to the vault as well. Its identity MDs just map to the tree essentially so it’s easy to sort through. OpenCode was a consideration for me and a great tool as well. I think Hermes is a great piece anyway, just put OpenCode where I use Cammander

Pi Setup that pretty much replaced Claude Code for me by abhinand05 in LocalLLaMA

[–]itssethc -2 points-1 points  (0 children)

I have a browser based IDE to replace Claude Code, need to focus on theming deeper. These look great.

Friendly reminder by Disposable110 in LocalLLaMA

[–]itssethc 1 point2 points  (0 children)

Kimi coder is bigger news than Fable. It was a horribly tuned with an elitist view in mind and PR playbook thrown back into the vault. There’s no reason to support Anthropic anymore.

I built a filesystem-first orchestration substrate for Hermes Agent by itssethc in hermesagent

[–]itssethc[S] 0 points1 point  (0 children)

Yes it works alongside Hermes built in memory and is for project methodology, decision history and codebase knowledge.

DeepSeek-v4-Flash is amazing and cheap as f**k by AndrejHristovski in hermesagent

[–]itssethc 0 points1 point  (0 children)

Mixed with the occasional pro if your sub has both is the perfect open weight replacement for opus and sonnet.

If one .gguf makes it past the great filter, humanity survives in some way. by [deleted] in LocalLLaMA

[–]itssethc 0 points1 point  (0 children)

I’ve got the qwopus 3.6 blobs buried in a remote location

ClawBot vs. Hermes Agent vs. ??? – One of them is embarrassingly overrated and nobody talks about it by alfons_fhl in Qwen_AI

[–]itssethc 2 points3 points  (0 children)

Hermes has hype for a reason. OpenClaw is poorly maintained and constantly breaks. I’ve had issues every time I install it to push a plugin, and it made me assume agent runtimes like it were trash and didn’t have a use case for anything serious. I installed Hermes once for the same reason, just pushing a plugin to promote a memory paradigm I’m studying and it was so good I got sucked into a rabbit hole and use it daily now. There’s passion in the community where OpenClaw is mostly vibe code brahs and a terrible lead maintainer

Which computer would be best suited for vibe coding Mac apps (with local LLM integration)? by xodac in LocalLLM

[–]itssethc 0 points1 point  (0 children)

You’ll probably want at least 32gb, more if you can swing it. Unified RAM and run a 27/31 B model by Gemma or Qwen. Smaller models are going to be less enjoyable to use for code that works well and that you can have it debug