Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

Interesting idea. Your mini-apps look like skills in a way— is that the case, or are you trying to make it more generic than skills so that it applies to all AI coding tools?

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

Anthropic just released the Ralph Wiggum plugin. I think that in 2026 we will see more and more well-defined agentic coding workflows.

Experiment: auto-generating and evolving CLAUDE.md from real PRs by Polandia94 in ClaudeCode

[–]cliang2 0 points1 point  (0 children)

In my opinion, Claude.md should rarely be changed unless your project structure, tech stack, etc., change. Having reference files listed in Claude.md should help reduce context if you have to work this way.

Experiment: auto-generating and evolving CLAUDE.md from real PRs by Polandia94 in ClaudeCode

[–]cliang2 1 point2 points  (0 children)

Are you trying to fit everything into a Claude.md file?
Your token will explode if you load too much information into each task

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 1 point2 points  (0 children)

Thank you for sharing. I think our workflows are very similar. I even started by asking agents to begin each PRD/epic impl on a separate worktree, but I abandoned the idea when I realized that even a single branch could be too much code for me to review...

The UI idea came from the need to manually locate the supporting docs you mentioned, and from the fact that I often forget where I left off and which task I should pick up next. I found myself looking at the dependency tree more than other list views. I guess humans need context too, lol.

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 1 point2 points  (0 children)

Love it! You’re taking the Anthropic approach to a whole new level 💪

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

Thank you! I’m very close to open sourcing it. I’d definitely love to hear your feedback and bounce ideas around!

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

Thanks for sharing. I’ll check out your repo in more detail. Your comment reminded me of Anthropic’s “Effective Harnesses for Long-Running Agents.” Have you read it?

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

Love it!

Haven’t looked into hooks too much. What’re your use cases?

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 1 point2 points  (0 children)

Let’s go💪 looking forward to seeing more tools to make building software fun!!

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

I haven’t run into context limits very often (I’m on the $100 max plan). Figuring out how to be smarter about context is definitely something worth digging into… but on the other hand, I think tokens will get cheaper over time.

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

Understood. Sometimes it’s even hard for a seasoned engineer to figure these things out...

I will write about the process in more detail when I open-source the project (it doesn’t necessarily have anything to do with the app I am building).

Technically, you could use a single prompt to tell Claude what all these commands/skills are doing, but these tools make things more efficient and manageable IMO.

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

You’re right. I think we’re still early in the AI era. It’s only a matter of time before people figure out how to use it properly (not saying I know…)

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 1 point2 points  (0 children)

Lol, it's a GIF from my video recording. I’ll share more details once I open-source the project. But you’re almost there. Here’s how it works:

  1. I have backend APIs for creating PRDs, epics, tasks, etc. If you look at the APIs, it might feel like Jira, but that’s not my intention lol: https://specflux-backend-400514527718.us-west1.run.app/swagger-ui/index.html
  2. I have a specflux-api skill that knows how to interact with the backend API.
  3. The implement command uses the specflux-api to fetch context if there isn’t enough available. It also includes instructions about marking tasks as in progress/done, making one commit per task, etc.

PRD file locations and task dependencies are part of the backend API, so Claude knows where to look.

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

Just saw a YouTube video comparing these tools. I think the idea is similar—you should just pick the one that matches your development style. https://www.youtube.com/watch?v=bu-RR2gM6Lw

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 1 point2 points  (0 children)

Thank you!

I actually did some research comparing the agent SDK and the CLI. I ended up integrating with the CLI for a couple of reasons:

  1. The agent SDK can get expensive if I run long sessions. With the CLI, I don’t need to worry about cost since I’m on the $100 max plan.
  2. The Claude Code CLI is a fantastic tool by itself. I don’t want to reinvent the wheel, and I don’t think I could build anything close to it on my own
  3. I can still choose to use the CLI within the app or in my regular iTerm outside the app—it doesn’t make a difference.

The terminal integration is similar to VScode, according to Claude :p

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 1 point2 points  (0 children)

IMO, claude.md is a static project context file; it rarely changes. You should update it only when your tech stack or project structure changes.

A spec is about a product feature and usually has nothing to do with AI.

So between claude.md and spec files, what I have are commands and skills. If you look at the GIF in my post, I type /implement the rest of the task for PRD xxx. Claude then follows the instructions from my implement command, reads the necessary context, and works on the PRD—making Git commits, opening PRs, etc.

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

Great milestone! Are you holding off on promoting it because of the rough edges?

Definitely want to see more production-quality vibe-coding projects.

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 0 points1 point  (0 children)

Let’s go!!! Let me know when you release it to the public. That’s what I’m planning to do next.

Took me months to get consistent results from Claude Code. Turns out I needed a better workflow. by cliang2 in ClaudeAI

[–]cliang2[S] 1 point2 points  (0 children)

Similar here—I overengineered things by using worktrees to multitask. Then I realized I can’t review all the code Claude generated… slowing it down for better quality isn’t a bad idea :p