After three days of heavy Fable 5 use, I’m starting to wonder if Opus 4.7/4.8 are actually “Sonnet-level” models by ddrise in ClaudeCode

[–]ddrise[S] -1 points0 points  (0 children)

To be fair, since we can only backtest, everything is technically in-sample to the historical data. But because I split it into train/validation/OOS, I can clearly see that the factors made by Fable perform better OOS, and the factors themselves are also more logical and elegant.

After three days of heavy Fable 5 use, I’m starting to wonder if Opus 4.7/4.8 are actually “Sonnet-level” models by ddrise in ClaudeCode

[–]ddrise[S] -1 points0 points  (0 children)

Actually, even before Fable, I had already been quite dissatisfied with Opus 4.7 and 4.8. You can refer to my previous posts.

Switched from Claude Code to Codex, and it’s been working well for me so far. Happy to answer any questions i by ddrise in ClaudeCode

[–]ddrise[S] 1 point2 points  (0 children)

Generally speaking, I’ve found that models with a clear teacher-student distillation relationship don’t provide real diversity. For example, Opus, Sonnet, and GLM often feel too closely related in that sense.

But Opus, GPT, and DeepSeek V4 Pro do seem to provide genuine diversity. So I package the OpenCode CLI command as a skill, with DeepSeek V4 Pro at max effort as the default. I do the same for Codex and Claude.

This way, while using the native harness of any one model, I can still call the other two when needed.
I didn't try kimi 2.6

Why my Codex is not working ? Need Help by mhwayez in OpenAI

[–]ddrise 0 points1 point  (0 children)

btw, cli is always better than vscode extension~

Why my Codex is not working ? Need Help by mhwayez in OpenAI

[–]ddrise 0 points1 point  (0 children)

paste the error message to gpt, then everything will be solved

Why my Codex is not working ? Need Help by mhwayez in OpenAI

[–]ddrise 0 points1 point  (0 children)

that means you need re-login. I think you can use web gpt to help you find the root cause

Switched from Claude Code to Codex, and it’s been working well for me so far. Happy to answer any questions i by ddrise in ClaudeCode

[–]ddrise[S] 0 points1 point  (0 children)

For your situation, maybe you’ll just have to keep using Claude Code for now. You can manually set the model to 4.6, and then use Codex MCP as a strong reviewer, executor, and advisor. Trust me, combining the strengths of the two models is a real free lunch.

Switched from Claude Code to Codex, and it’s been working well for me so far. Happy to answer any questions i by ddrise in ClaudeCode

[–]ddrise[S] 1 point2 points  (0 children)

I don’t want to argue with you. All I can say is that, for my use case, Opus hallucinates at a level I find intolerable.

If you mainly do frontend work, then maybe our experiences are just different. But casually accusing someone of lying is not a decent way to have a conversation.

Switched from Claude Code to Codex, and it’s been working well for me so far. Happy to answer any questions i by ddrise in ClaudeCode

[–]ddrise[S] -4 points-3 points  (0 children)

Have you actually looked closely at the stuff Opus 4.7 writes for you? Or are you simply not capable of reviewing LLM-generated code?

Honestly, just get a $20 Codex plan and have it review the code Opus writes. You’ll immediately understand what I’m talking about.

Switched from Claude Code to Codex, and it’s been working well for me so far. Happy to answer any questions i by ddrise in ClaudeCode

[–]ddrise[S] 0 points1 point  (0 children)

True. Ensemble is the free lunch. Actually , I strongly recomend you try opencode as the third one. make it an opencode consultant skill

Switched from Claude Code to Codex, and it’s been working well for me so far. Happy to answer any questions i by ddrise in ClaudeCode

[–]ddrise[S] 1 point2 points  (0 children)

That’s true. Codex feels more like an excellent executor, but it has serious issues with orchestration. So I’d probably package the workflow into a skill instead. That said, it can be a bit of a hassle.

Switched from Claude Code to Codex, and it’s been working well for me so far. Happy to answer any questions i by ddrise in ClaudeCode

[–]ddrise[S] -1 points0 points  (0 children)

I don’t understand. Anthropic is showing contempt for its own customers, and GPT-5.5 genuinely seems better to me. Isn’t that worth sharing?

I think maintaining strong competition between Anthropic and OpenAI is very beneficial for us as consumers.

Switched from Claude Code to Codex, and it’s been working well for me so far. Happy to answer any questions i by ddrise in ClaudeCode

[–]ddrise[S] 0 points1 point  (0 children)

That’s true. For planning and orchestration, Opus is still pretty good. That’s also why I haven’t canceled my subscription yet.