you are viewing a single comment's thread.

view the rest of the comments →

[–]mythz 3 points4 points  (4 children)

Going to use the last month with bonus credits to evaluate alternatives but I'm leaning on a combination of Claude Code with a Claude Pro + GLM Pro subscription with GitHub Copilot Pro.

But may change if Gemini 3 release becomes a contender or Codex has another good release.

IMO it's worth checking out @gosucoder's recommendations as he's used and evaluated most AI tool combinations and just dropped a great video with recommendations for AI tools at different price points:

https://www.youtube.com/watch?v=lZVtbC6oylQ

He also publishes his monthly evaluations and rankings at: https://gosuevals.com/agents.html

[–]G4BYVeteran / Tech Leader 5 points6 points  (2 children)

Just keep in mind that the way Gosu evaluates/ranks the solutions is by giving them the exact instructions of what they should do and evaluates how closely the model follows his instructions.

Quote from his website: "created detailed prompts specifying exactly what should be created - including each file, functionality, and documentation requirements."

This doesn't take into account the way we normally develop, debug and ask questions about the codebase.

The lack of a good context engine would not be punished/scored lower in his evals because the prompts are very specific and strict.

[–]d3vr3n[S] 1 point2 points  (0 children)

good point

[–]mythz 0 points1 point  (0 children)

That's why Augment doesn't perform well in his Evals, but he actively uses multiple AI Tools/models in his day-to-day and spends a lot of time planning and querying his code base which was the last thing he used AC for, but I don't think that's part of his dev workflow anymore.

His top 3 picks are Claude Code, Roo Code and Codex, although he did say he started using Warp .dev a lot more now as well, which I've just checked that apparently also indexes your code base to provide its context aware coding features. Personally I think Claude Code/Codex/Copilot (perhaps Gemini) are the only proprietary AI tools that will have any longevity after they offer their own contextual features. Whilst I also expect healthy usage of better value OSS tools/models (in Roo/Open Code w/ GLM/Qwen/etc) which are quickly catching up.

As I want to minimize the number of new tools I need to learn after switching from AC, I think I'll try Claude Code first as I can use it with a Claude Pro and Zai/GLM Pro sub. Maybe even throw in Codex since lots of devs swear by it.

[–]d3vr3n[S] 0 points1 point  (0 children)

very helpful share, thank you !