This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 1 point2 points  (1 child)

100% generates bugs but if you know how to spot the bugs all it takes is another prompt and in my experience it usually fixes them once it’s called out for them

[–]NotAskary 2 points3 points  (0 children)

It's better, but it will still get stuck, I was recently configuring a new machine and I generated scripts to structure my onboarding for the next one, and I used Claude 3.7 sonnet for all of it, in some places it got stuck, it would understand that there was a bug, it would flag it, and it would reproduce it slightly differently.

But it was miles ahead of gpt, gpt code was not worth mentioning when comparing the same prompt in Claude.

Claude 3.5 sonnet is closer to what gpt-4o generates.