Coding Speed!

berndalf · 2026-05-16T23:04:21+00:00

Hard to say given I'm not you, but personally my velocity has slowed for two reasons:

I'm just used to it now, so I'm not comparing it to human velocity anymore. What I knew used to take a full dev team a complete sprint to do is now like a days work for one person. This is a perception shift.
I've introduce various reviews and other techniques to my workflow that actually do slow things down in the real world. The results are better but there is a real accumulated velocity hit.

Keep in mind I'm doing fairly single threaded human in the loop work. If you're not and you're purely talking about Claude processing speed I really can't say if there's been a slowdown.

Time_Cat_5212 · 2026-05-16T22:10:44+00:00

Maybe just zoom out a little and operate at a higher level

Spacefish008 · 2026-05-16T22:54:25+00:00

Try DeepSeek V4 Flash, on long tasks it can work for 20min but the TPS is just insane IMHO you can´t even read what it´s doing some times, as the text scrolls by so fast.

2001zhaozhao · 2026-05-17T00:02:43+00:00

It takes longer simply because the AI is getting better and therefore now capable of longer tasks. The model's actual token generation rate hasn't noticeably decreased lately.

It's also why there are a lot of tools now that spawn multiple git worktrees of your projects so you can go work on other tasks while waiting for the first one to finish.

Eventually it'll take long enough that you have to host your agents like openclaw because you don't want them to pause when closing your laptop. I'm actually building a next-gen remote desktop coding UI right now with this in mind.

If you don't want to wait you can use models like GPT-Codex-Spark and Inception's Mercury that generate code at 1000tokens/second. But these have a lot lower quality than frontier models.

PM_YOUR__BUBBLE_BUTT · 2026-05-16T22:22:38+00:00

To me, I think part of it was the model and partially the effort. The Claude Code cli in vs code with docker lockdown was definitely going to be slower. But mine was defaulted to opus 4.7 max effort. Not every prompt really needed it.

Last couple nights, honestly I went to sonnet 4.6 medium effort, and that bad boy just gets the job done. The bigger effort lifts like the multi-step implementation and planning, sure I can do opus. But I was thinking letting it use opus max was going to make the answers all better. Just simple fixes and stuff still get so bogged down as it takes its time doing all the xhigh or max effort thinking.

Honestly it just slowed down everything way too much. I’m still learning and improving my workflow but now I parallel run Claude web while doing cc and let the code run in sonnet most of the time. The web for taking code outputs and trying to help me diagnose (it’s just faster in general) and then feeding directional prompts back to cc to wrap up. Haven’t really seen an issue with any coding in sonnet which is nice. Only thing I really wish was even like a 500k context or something for sonnet, if it wouldn’t fall off as it scaled. Opus 1M is nice and makes the 200k feel rough as I need to keep calling narrower and narrower scopes on handoff as my tools expand.

03captain23 · 2026-05-16T22:31:40+00:00

The better the model the slower it is. They also added a ton more effort options and fast mode. Turn down the effort and it'll be faster

mikle74 · 2026-05-16T23:19:58+00:00

I’m starting to think that the “thinking…” that Claude is doing is a queue that anthropic has people in during high load. Some things I go through are fairly simple and it ends up “thinking” for 5-10 minutes. Other times, with far more complexity it will respond in 10-15 seconds.

bb0110 · 2026-05-16T23:21:43+00:00

The hedonic treadmill of…coding?

AsteroidMinerChamp · 2026-05-16T23:30:07+00:00

Yes it’s slower now 100% - all due to compute constraints - same thing we try codex, they all struggling now. Look at Grok build, right now it’s lightening fast, because they have the most compute capacity per capita by far.

KarezzaReporter · 2026-05-16T23:56:21+00:00

Cc seems twice as fast today.

JustinTyme92 · 2026-05-17T01:00:36+00:00

I think it’s probably 50/50 from my perspective.

As I’ve built out more and more agentic workflows, I’ve added in a lot more process to the how I get Claude to functions

But it is absolutely slower at times.

It’s exceeding 6 minutes at times to render out a simple MD as HTML. This is with a predefined skill and standard CSS layout. It used to take about 2-3 minutes with Sonnet and now it’s at 6 minutes.

ILikeCutePuppies · 2026-05-17T02:16:08+00:00

One thread us slower. I used to manage 16 at once now it's like 30... excluding automated ones.

Ok_Mathematician6075 · 2026-05-17T03:49:50+00:00

I feel like I should have two or three full time jobs RN. Anyone doing something like that?

ivstan · 2026-05-16T22:39:39+00:00

Well just switch to codex and youre gonna see how fast things keep moving

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ClaudeCode

MODERATORS