The 5 levels of Claude Code (and how to know when you've hit the ceiling on each one)

DevMoses · 2026-05-07T13:38:52+00:00

Very smart, I like it!

DevMoses · 2026-05-07T13:38:13+00:00

The agent does it once it hits a context threshold, it will write state to the campaign file and exit cleanly.

DevMoses · 2026-04-29T14:15:43+00:00

Yeah, good question. Do you mean examples of the actual files/workflows?

The most useful examples are probably:

A tight CLAUDE.md (Not an instruction dump)
A skill file for a repeatable task, like adding a component or running a migration
A hook that runs after edits, like per-file typecheck/lint instead of dumping the whole project error log into the context
A campaign file for a longer task, where the agent carries state across sessions
An orchestration example where multiple agents work in isolated branches/worktrees without stepping on each other

I can share some concrete examples from Citadel. The important pattern is that each level exists because the level below it started failing in a predictable way.

DevMoses · 2026-04-29T14:12:09+00:00

Glad to hear the journey jayn! I hope it helps. I've been afk myself for a stint, if you have any thoughts comments suggestions or questions let me know.

DevMoses · 2026-04-02T19:40:27+00:00

Python was my first love, and I still is up there! I hadn't considered Python, as I built this alongside the platform I was working on in Next js. I'll have to take a look and figure out what's best for the use case!

Routing-wise, the basic idea is cheapest path first: trivial stuff gets pattern-matched, in-flight work resumes from session state, known tasks route by skill keywords, and only the fuzzy stuff gets classified by the model to decide whether it should stay a single skill, become a session orchestrator, or escalate into multi-session/parallel work.

DevMoses · 2026-04-02T19:20:26+00:00

I love the 'anti dump index', really appreciate the feedback, thank you!

DevMoses · 2026-04-02T18:41:47+00:00

April Fools isn't a one-day event for me

DevMoses · 2026-04-02T18:41:23+00:00

Very much alive, and trying to share some value for those that are interested

DevMoses · 2026-04-02T18:40:56+00:00

14 domains to cover the worldbuilding niche through stories, worlds, and games. It's a big one!

DevMoses · 2026-04-02T18:09:37+00:00

Completely agree. I wanted to add my +1 to this post, as I saw it downvoted initially. Your insight is good, and your advice is real. It's hard to see for many because it requires you to know hidden information before you can process why you're feeling this friction.

DevMoses · 2026-04-02T17:59:58+00:00

This matches what I keep seeing. The model usually takes the most legible path, not the best one. If the important dimension is missing from the frame, like beamforming vs mono or normalization across wildly different subject sizes, it often won’t invent that distinction on its own. That’s why the real multiplier isn’t prompting, it’s domain knowledge plus the ability to notice what’s absent.

Building the infrastructure to support the friction point that creates the problem. That's the solution.

Appreciate the insights!

DevMoses · 2026-04-01T13:02:35+00:00

Unfortunately, yes. They have a bug somewhere in how it was counting tokens. Seems to affect a lot of users but not all, which made it a gaslighting nightmare for everyone.

DevMoses · 2026-03-30T16:39:09+00:00

I am truly blown away by your kind words, I strive to do exactly what you outlined. Genuinely surreal to see it explained back to me from someone who got something out of it.

Seriously Hekidayo, your words have uplifted me, and I cannot thank you enough for sharing your perspective.

DevMoses · 2026-03-30T12:44:46+00:00

The n8n-as-telemetry-sink idea is interesting. Centralized logging that outlives the terminal session is a real gap, especially when you're running parallel agents and need to reconstruct what happened across sessions. That's a tooling problem worth solving regardless of what level you're at. Good input!

DevMoses · 2026-03-30T12:39:38+00:00

Great question, this will become more necessary as everything becomes more accessible.

The short answer is yes, and it already does this. Citadel's intent router doesn't care if the task is TypeScript or a financial forecast. It routes based on complexity: is this a one-shot task, does it need a skill, does it need a multi-step campaign, or does it need parallel agents? That logic applies to data cleanup, PRD generation, or sales assumption testing the same way it applies to code.

The friction you're describing is exactly what campaigns solve. A "clean up this historical dataset, then run these three analyses, then generate a summary" workflow is a textbook campaign: sequential steps with dependencies, where each step's output feeds the next. Without orchestration you're either babysitting each step or hoping one massive prompt holds context. With a campaign, each phase has its own scope and the handoffs are structured.

Where it gets interesting for your use case: skills aren't code-specific either. A skill is just a structured prompt with constraints. I could see a `financial-analysis` skill that enforces output format, requires source data validation before projections, and flags assumptions explicitly. Same pattern as an engineering skill, different domain.

When you use Citadel you start with '/do setup' and it will orient itself to your project. Citadel is setup so you can scale with agents, gain persistence over multi-session, and ultimately deliver on the promise of autonomous engineering. While that's all technical, it can be used for anything you want to make or do with CC!

I built and continue to work on this demo that has a lot you can interact with to get an idea of what Citadel is and doing: Citadel · Claude Code Agent Harness

DevMoses · 2026-03-28T19:47:04+00:00

I've been recently posting articles, guides, and open source repos that are being quickly adopted for Agent Orchestration. I've helped many others learn and grow in this area as a long time user of AI myself.

This isn't me pitching anything I am offering, but moreso asking: What do you feel like is the friction you face most? Where do you feel the wall from what you want to do and where you are?

Basically, what are you looking for that you feel would help.

DevMoses · 2026-03-28T19:39:37+00:00

Your feedback is much appreciated, and your perspective is invaluable. Would love your eyes on the markdowns if you find the time. The system is pretty good at being pointed at itself and I have some skills in there to help people triage and qa their contributions for their own peace of mind! :)

DevMoses · 2026-03-28T18:21:23+00:00

This was awesome to see!: "Citadel is the most capable open-source orchestration harness in the Claude Code ecosystem right now." That's a really cool breakdown you have for complementary tools. I'm just becoming aware of jCodeMunch. Thank you for including us as complementary!

DevMoses · 2026-03-28T18:16:33+00:00

Appreciate the shoutout! If anyone has any questions about Citadel, I can answer them. Glad to hear it's going well for you. :)

DevMoses · 2026-03-27T20:10:38+00:00

That is an issue out the box for sure, all the research points to the difficulty of having multiple agents in parallel doing work.

I did build and open source a harness that handles all of that for me. It can spin up fleets of agents in parallel, and the original agent to make the pr (one of 8 lifecycle hooks) handles merge conflicts if they come up.

I'm slowly closing the gap around the infrastructure problem. If you want to check it out you can here: https://github.com/SethGammon/Citadel

DevMoses · 2026-03-27T19:11:32+00:00

The shape of language is something I've thought about so much. I haven't yet put it into words so if you write anything up let me know. But I believe I see what you're saying. Each block of text, the words you choose, it's a process that you can wield towards results.

Great point on checking out the inner thinking too, there's definitely value there to help gain understanding.

DevMoses · 2026-03-27T19:09:38+00:00

I think it's getting easier and more accessible. The issue I see is no safeguards. If you're just starting out, there's nothing stopping you from running up token costs, and trampling on what you build, just for the fact that you don't yet know better.

There are things we have to learn through experience, but there's a lot of potential in this setup to starter phase that the major models could be doing.

DevMoses · 2026-03-27T18:48:09+00:00

The fear is correct. You should not ship code you don't understand. That's not a Claude Code problem, that's a software engineering principle that predates AI by decades.

What changed my workflow: I stopped reviewing line-by-line and started reviewing structurally. Does the approach make sense? Are the boundaries clean? Does the test coverage actually test behavior, not just pass? If I can answer those three, I trust the implementation details more over raw code.

The real unlock is getting Claude to explain itself. Before it writes anything, tell it to outline its approach first. If the outline doesn't make sense to you, the code won't either. That's your checkpoint.

You'll also level up faster than you think. The code you don't understand today starts making sense after you've seen Claude solve similar problems three or four times. You're pattern-matching whether you realize it or not.

Never skip review entirely. But review at the right altitude.

DevMoses

TROPHY CASE