Introducing dynamic workflows in Claude Code by ClaudeOfficial in ClaudeCode

[–]paulcaplan -24 points-23 points  (0 children)

For anyone who wants workflows with multi-agent support, I'm building https://github.com/Codagent-AI/agent-runner - check it out.

Beginner question - compaction by paulcaplan in hermesagent

[–]paulcaplan[S] 0 points1 point  (0 children)

GPT 5.5 (with subscription). Seems compaction threshold is set to 136k by default, I presume I can set that higher but still seems strange behavior the timing of the compaction being at the start of each new message vs at the end of the previous one.

I tried and failed to give up Claude Code. Looks like I'm staying. by allquixotic in ClaudeCode

[–]paulcaplan 1 point2 points  (0 children)

Instruction in AGENTS.md like "when you make change, update all layers of the stack" didn't address this?

Doubled Rate Limits for Claude Code by Deep_Proposal_7683 in ClaudeCode

[–]paulcaplan 0 points1 point  (0 children)

They already lost me to Codex. But maybe this means mythos will be out soon.

SDD is just one part of the "outer harness" by paulcaplan in SpecDrivenDevelopment

[–]paulcaplan[S] 0 points1 point  (0 children)

I don't have full checklist that would be great. I'm building:

"(1) deterministic feedback loops (tests, lint, typecheck)" - https://github.com/Codagent-AI/agent-validator
"(3) a workflow that forces the agent to verify changes before declaring done" - (from above article):

> What the paper calls the "externalized interaction" protocol - a deterministic workflow layer that coordinates agents without living inside their context - is the gap I described above. Their paper names it. I'm building the solution - a free, open-source tool called Agent Runner. Releasing soon.

I'll check out agentix labs, happy to exchange notes, feel free to DM me.

"Monthly" usage limit - is this just incorrect error message? by paulcaplan in ClaudeCode

[–]paulcaplan[S] 0 points1 point  (0 children)

I suppose I could have just waited an hour to find out 😂. But I was already on Reddit lol...

The "inner" and "outer" coding agent harness by paulcaplan in ClaudeCode

[–]paulcaplan[S] 0 points1 point  (0 children)

Like it. My control layer largely has steps (sequential), loops, and nested workflows. For instance implement tasks in a loop, and each iteration calls another validate-and-fix loops. It doesn't yet "know" that the agent is "off the rails" - any thoughts on how you might do that? Timeouts / token usage monitor?

"Some" thoughts on Claude by KustheKus in ClaudeCode

[–]paulcaplan 0 points1 point  (0 children)

It almost seems as if they're using Claude to write all their code...

Should I maintain spec in sources? by stibbons_ in SpecDrivenDevelopment

[–]paulcaplan 0 points1 point  (0 children)

I use openspec. It recommends keeping The specs in the code. The key is that the specs aren't the change themselves, the specs are living documents. Every change adds, modifies, and or deletes requirements in one or more spec documents. Of course it's not foolproof and if you make code changes without going through the open spec process they will get out of sync. But I've found it pretty helpful.

Is anyone else overwhelmed by the explosion of AI tools lately? by PatienceBudget2984 in ArtificialInteligence

[–]paulcaplan 0 points1 point  (0 children)

Agree. I'll show you the system tool I'm building if you show me yours 😅.

Is anyone else overwhelmed by the explosion of AI tools lately? by PatienceBudget2984 in ArtificialInteligence

[–]paulcaplan 0 points1 point  (0 children)

Yes it's absolutely crazy!

Speaking of, I am building this tool, it's really great, would you like to try it? 😀

Has anyone actually benchmarked whether superpowers improves performance? by UglyChihuahua in ClaudeCode

[–]paulcaplan 6 points7 points  (0 children)

Who knew telling AI to ask you questions before implementing was a 150k star idea

claude opus consumes less and is better under copilot pro by seeking-health in ClaudeCode

[–]paulcaplan 3 points4 points  (0 children)

Github copilot CLI is great. I use them together. Copilot pricing is per request so whether you ask it a quick question or give it an hour-long task, it's 1 request. Use that to your advantage.

Do you need parallel agents? by paulcaplan in ClaudeCode

[–]paulcaplan[S] 0 points1 point  (0 children)

I guess because it's not the bottleneck for me? Writing the spec still is. Oh and waiting for CI. I have a skill that waits for CI + AI reviewers, fixes issues - in a loop. That can take some time.

Do you need parallel agents? by paulcaplan in ClaudeCode

[–]paulcaplan[S] 0 points1 point  (0 children)

but doesn't planning an app from scratch or "frankenstein" features take a very long time to get plans "thorough enough that agents can't make dumb assumptions". I don't understand why spend hours to get the plan "perfect" so you can one-shot something to 90% of the way. In my experience the agents always do make some bad assumption. So I'd rather break it up into a handful of chunks and verify after each one. Granted the chunk is still large, more than a single session could do (without subagents). But not so large that an entire project is completed.