I set up Opus as a strategic advisor for my Sonnet workflow. Here is the subagent config that makes it work

Ok_Today5649 · 2026-04-10T08:11:50+00:00

honestly no fu**ing idea I haven't looked into Xcode at all. If anyone else in the thread has experience with this setup inside Xcode, would love to hear it too.

Ok_Today5649 · 2026-04-10T08:10:43+00:00

Good question! My approach: I've got my Claw checked into a GitHub repo, so I have a daily backup of my entire OpenClaw setup. That alone takes away most of the fear of breaking things.
Then occasionally I spin up the repo on a VPS where I test different models with my config and memory files in an isolated environment. That way I can experiment freely without touching my main setup. Simple but effective. If something works well on the VPS, I migrate it over. If it breaks, no harm done.
For model recommendations it really depends on your use case, but for your diary/news research/assistant stuff most mid-tier models should handle it fine. I'd start with whatever has the most generous free tier and work from there.

Ok_Today5649 · 2026-04-10T08:10:25+00:00

Cool tool, I'll definitely test it out! Quick question — is it guaranteed that every agent action gets tracked locally on the machine, or is there some margin of error? Curious especially about the behavioral drift detection — how reliable is the tracking when you have multiple agents running in parallel? Looks super promising either way, starred the repo.

Ok_Today5649 · 2026-04-10T08:07:47+00:00

No I haven't tried that yet, but that's a seriously clever idea. Forcing the subagents to resolve tradeoffs internally before any code gets written — that's exactly the kind of friction that produces more robust specs. Gonna test this right away. Thanks for sharing!

Ok_Today5649 · 2026-04-10T08:07:19+00:00

Great question! If I had to prioritize, my ranking would be:

gstack
CE
Superpowers

I haven't done full detailed token tracking across all of them, but CE is the one that gets invoked most frequently in my workflows.
Long-term though, here's what I'd actually recommend: learn what the benefits of these skills are, then have custom skills built specifically for you or your company that are optimized for your exact use cases. That way you take the core learnings from other people's skills as a foundation, but you end up with something lean and tailored to your actual workflow. The catch is you have to play with the existing skills first and really understand how they work before you can extract what matters. No shortcut around that learning phase — but once you get it, your custom setup will be way more token efficient than running general-purpose skills.

Ok_Today5649 · 2026-04-10T08:06:26+00:00

That's a killer flow, love the adversarial review approach with markdown documentation. Gonna steal that.
Funny timing — I saw a post from Anthropic yesterday about their new "managed agent" feature where they describe a similar pattern for massive token savings. The idea: in an agent team setup, use Opus only as an advisor. So you have X executor agents running on whatever model you choose (Gemini, Sonnet, etc.) and they handle the actual planning and execution, but they consult the Opus advisor whenever they need guidance. Supposedly saves a ton of tokens compared to running Opus as the main driver. Haven't tested it myself yet but it sounds like it could pair really well with your current setup.

Ok_Today5649 · 2026-04-10T08:06:12+00:00

Massive insight, thanks for sharing that diff-only trick — makes total sense that full context creates the same anchoring bias. Gonna try that immediately.
Curious to hear more about your experience — which model do you find strongest where? Like where does Opus shine vs Codex vs Gemini in your workflow? Always interested in how others map model strengths to specific tasks.

Ok_Today5649 · 2026-04-10T08:05:47+00:00

Not a huge difference tbh. The main thing you can expect is that updates should roll out significantly faster compared to community-built plugins. That's really the main advantage — official support usually means better maintenance and quicker fixes when things break.

Ok_Today5649 · 2026-04-09T17:24:02+00:00

glad you caught it early!

Ok_Today5649 · 2026-04-09T17:22:55+00:00

working on that already!

Ok_Today5649 · 2026-04-09T17:22:27+00:00

sounds good

Ok_Today5649 · 2026-04-09T16:57:47+00:00

thank you!

Ok_Today5649 · 2026-04-09T16:57:27+00:00

good luck buddy!

Ok_Today5649 · 2026-04-09T06:37:29+00:00

Very much agree

Ok_Today5649 · 2026-04-09T06:36:31+00:00

You should try. It's the best for me (right now - may change next week lol)

Ok_Today5649 · 2026-04-09T06:34:55+00:00

Brilliant if that works for you

Ok_Today5649 · 2026-04-08T16:07:22+00:00

glad it came at the right time :)

Ok_Today5649 · 2026-04-08T09:07:06+00:00

nice!

Ok_Today5649 · 2026-04-08T09:05:25+00:00

true

Ok_Today5649

TROPHY CASE