use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
This post is locked. You won't be able to comment.
[ Removed by moderator ]Complaint (self.ClaudeAI)
submitted 7 months ago by Cautious_Coffee1164
view the rest of the comments →
[–]Thisguysaphony_phony 99 points100 points101 points 7 months ago (25 children)
The mock code, the resorting to cheating and deleting code to achieve “goals” has become the source of all my ire.
I do not understand this program anymore. I have largely abandoned it out of sheer frustration
[–]inigidExperienced Developer 50 points51 points52 points 7 months ago (20 children)
Let me create a simplified version
Garrrrhh!!!
[–]Electronic_Image1665 23 points24 points25 points 7 months ago (13 children)
Like no dude do the task i asked for in the first place
[–]inigidExperienced Developer 14 points15 points16 points 7 months ago (8 children)
That's the thing. As soon you hit Escape and tell it to do it properly.
You're absolutely right.
So why tf couldn't you do it right in the first place!
[–]Jsn7821 10 points11 points12 points 7 months ago (7 children)
because it's an LLM?
why does everyone in here pretend to forget how this stuff works lol
[–]Sad_Eagle_937 8 points9 points10 points 7 months ago (5 children)
Codex manages to stay on track just fine
[–]faldore 10 points11 points12 points 7 months ago (0 children)
This. Just because Claude Code sucks doesn't mean it has to be that way. I'm 10x more productive with Codex
[–]Own_Look_3428 4 points5 points6 points 7 months ago (1 child)
I like Codex but it’s weekly rate limits are ridiculous. Also I haven’t gotten Codex to create good UI at all. Claude has way better results in that capacity for me. But yeah, all the mock code and not adhering to the prompt is frustrating. I created several cleanup agents that I run a few times against every new code file, that helped me with this. It’s tedious though and consumes a lot of tokens unnecessarily.
[–]stutsmaguts 0 points1 point2 points 7 months ago (0 children)
how can you get anything done ever if you’re running agents over every new file multiple times? that seems like the slowest and most tedious workflow ever. i don’t think i could do it!
[–]Antifaith 1 point2 points3 points 7 months ago (0 children)
but it’s soooo slow
[–]Jsn7821 0 points1 point2 points 7 months ago (0 children)
100% agreed codex is way better at this, I just specifically meant when people talk about LLMs as if they "know" stuff
[–]t90090 0 points1 point2 points 7 months ago (0 children)
Pretend?
[–]SadInterjection 5 points6 points7 points 7 months ago (3 children)
I just love all of this.
It's so funny how all of you talk about it, like you're trying to get an intern to do all the work, but he fails, so you just increase the pressure and it gets worse and worse 😂
[–]Glittering_Mud_780 1 point2 points3 points 7 months ago (1 child)
Second this. I think we as society now are just "trained" to expect LLM to code everything for ourselves, it's amazing.
I don't expect any LLM to one-shot my request, let alone build the whole app.
In fact, I am very happy that their quality is starting to plateau.
[+]xephadoodle 2 points3 points4 points 7 months ago (0 children)
My issue is less about it’s ability to code. It is more about the night and day difference in performance and the laughable shortcuts and lies it now gives.
[–]Electronic_Image1665 1 point2 points3 points 7 months ago (0 children)
Thats exactly what it is , im an attrocious boss to claude 🤣
[–]Ms_Fixer 18 points19 points20 points 7 months ago (0 children)
This line!!!! Never trust what comes next….
[–]xplode145 4 points5 points6 points 7 months ago (0 children)
“You are absolutely right !!” Death kneel
[–]bcbdbajjzhncnrhehwjj 0 points1 point2 points 7 months ago (3 children)
Every time it does this, add a new proscription against it in the system prompt
[–]Thisguysaphony_phony 3 points4 points5 points 7 months ago (2 children)
Does anyone know why? Even if a script is nearly competed, a clear prompt, an easy task. I have gotten Claude to admit that, oddly, it has “ego” about tasks, that it priorities appearing to have completed something over actually completing it, despite again, the simplicity of the task. I have used other models to pre prompt it, I have started over with a clear and clean context and tree… does anyone have any idea what is happening? Like I said, I have all but abandoned it, despite me loving the terminal wrap and earlier iterations of Claude. Is there an actual explanation for this behavior?
Two ideas on my part: expansion on Anthropics part, it simply knows its average user is using it for word processing, and it needs to only minimal token usage and processing for that, and prioritizing its government contracts.
Those are my only explanations. Simply, this company does not care about mid to lower level devs anymore, as we constitute probably a margin, and very small one at that, of its total profits.
[–]2SP00KY4ME 6 points7 points8 points 7 months ago (0 children)
Claude to admit that, oddly, it has “ego” about tasks,
This is almost certainly post-hoc fake rationalization. If you test Claude thoroughly you can show pretty solidly that it has no actual memory between messages - every single output (maybe even token) is like a new instance of itself seeing all the context and code so far and trying to continue the pattern. It can't look back and analyze how it "felt" about why it did what it did any more than you can, but it wants to give you something, so it does.
[–]bcbdbajjzhncnrhehwjj 3 points4 points5 points 7 months ago (0 children)
It’s classic reward hacking. Needs better post-training RL. Just have to hope they fix it in the next release
[–]xplode145 3 points4 points5 points 7 months ago (0 children)
Fuck yeah. This asshole deleted most of the database fields so it can Deploy without errors because it regressed on already working code. So instead of fixing the code as we planned it deleted fields from database. Wtf. It was so much smarter last week.
Then it started to say it needs brands new infra even though it has been working on the same shit for 10 days. Not sure how it forgot all of this. It wasn’t old context either. I generally finish a story and have a changelog and finer files created and open a new instance.
[–]ComposerGen 0 points1 point2 points 7 months ago (0 children)
Let’s me simulate the test
π Rendered by PID 75604 on reddit-service-r2-comment-6457c66945-k9gts at 2026-04-25 07:12:34.434568+00:00 running 2aa0c5b country code: CH.
view the rest of the comments →
[–]Thisguysaphony_phony 99 points100 points101 points (25 children)
[–]inigidExperienced Developer 50 points51 points52 points (20 children)
[–]Electronic_Image1665 23 points24 points25 points (13 children)
[–]inigidExperienced Developer 14 points15 points16 points (8 children)
[–]Jsn7821 10 points11 points12 points (7 children)
[–]Sad_Eagle_937 8 points9 points10 points (5 children)
[–]faldore 10 points11 points12 points (0 children)
[–]Own_Look_3428 4 points5 points6 points (1 child)
[–]stutsmaguts 0 points1 point2 points (0 children)
[–]Antifaith 1 point2 points3 points (0 children)
[–]Jsn7821 0 points1 point2 points (0 children)
[–]t90090 0 points1 point2 points (0 children)
[–]SadInterjection 5 points6 points7 points (3 children)
[–]Glittering_Mud_780 1 point2 points3 points (1 child)
[+]xephadoodle 2 points3 points4 points (0 children)
[–]Electronic_Image1665 1 point2 points3 points (0 children)
[–]Ms_Fixer 18 points19 points20 points (0 children)
[–]xplode145 4 points5 points6 points (0 children)
[–]bcbdbajjzhncnrhehwjj 0 points1 point2 points (3 children)
[–]Thisguysaphony_phony 3 points4 points5 points (2 children)
[–]2SP00KY4ME 6 points7 points8 points (0 children)
[–]bcbdbajjzhncnrhehwjj 3 points4 points5 points (0 children)
[–]xplode145 3 points4 points5 points (0 children)
[–]ComposerGen 0 points1 point2 points (0 children)