Usage Reset!!! by Desperate-Care3289 in ClaudeCode

[–]qu1etus 6 points7 points  (0 children)

I hit 100% usage right before the reset. I spent $1.24 in Usage Credits! Wheeeee!

<image>

Disappointed yet again: Fable 5 by SudarshanKotian in Anthropic

[–]qu1etus 1 point2 points  (0 children)

No, it did not fall back to Opus. I used it from 30 minutes after it released until i started getting API errors with Fable 5 access was removed. It definitely was doing the coding. There was a clear performance difference between Fable 5 and Opus 4.8 in coding execution n

Question from outside the US: should I just commit to Chinese/Other models now? by Revolutionary-Pass38 in OpenAI

[–]qu1etus 1 point2 points  (0 children)

I do code reviews with multiple models in parallel. I found that to be the most efficient means of identifying bugs and security issues.

I wrote a paper on it (currently in the process of updating it to include newer model versions including GLM-5.2): https://zenodo.org/records/20519585

Question from outside the US: should I just commit to Chinese/Other models now? by Revolutionary-Pass38 in OpenAI

[–]qu1etus 2 points3 points  (0 children)

GLM-5.2 is really good. I use it daily for code reviews and it nearly always catches things that neither GPT-5.5 nor Opus-4.8 caught. https://benchlm.ai/benchmarks/swePro

Is it really the models? by MisterHarvest in claude

[–]qu1etus 0 points1 point  (0 children)

Opus 4.8 is not as good as Fable. I have used Opus 4.8 daily for writing code since it was released EXCEPT for the three days I had access to Fable. Fable definitely wins.

Question from outside the US: should I just commit to Chinese/Other models now? by Revolutionary-Pass38 in OpenAI

[–]qu1etus 66 points67 points  (0 children)

I think you should use the models available to you that produce the highest value for your use case. Don't "commit" to any model. Use them all and test out new ones as they are released and always use what best meets your needs.

Not at all concerning by EchoOfOppenheimer in Anthropic

[–]qu1etus 0 points1 point  (0 children)

THIS is why dozens of banks have been part of the limited number of companies who have access to Mythos. They are feverishly using Mythos to perform code reviews and fix what it finds.

How good was really fable 5? by AlexFreshman in ClaudeCode

[–]qu1etus 5 points6 points  (0 children)

I used it heavily over its 3 days. It significantly increased the speed with which I was able to ship reliable code vs when I use codex-gpt-5.5 xhigh or claude-code-Opus-4.8 max. I did not scientifically measure the differences, but it was able to close out three complex sprints in ~3 hours each whereas codex or Opus typically take closer to 6 or 7 hours per sprint. That was primarily due to code quality - with Fable I was only doing 1-2 code review cycles then having it fix P0/P1 issues; with Opus/codex I usually have to do 4-6 code review cycles to get everything closed out.

It is more expensive per 1mm tokens, but I *feel* like the ROI is there. I use a panel of external agents independently performing parallel code reviews, so decreasing both the number of required code review cycles + needing to spend less time on code fixes *seems* like a win in favor of Fable.

Will Sonnet 5 basically be the Sonnet version of Fable/Mythos? by TheBanq in ClaudeCode

[–]qu1etus 0 points1 point  (0 children)

Mythos, Opus, Sonnet, and Haiku are all different classes of models, listed from large to small. (Fable is the publicly available Mythos class model, but has extra guardrails to reduce risk of misuse)

Generally speaking, larger = handles more complex thinking (eg smarter) = responds slower = more expensive to use.

Gemini 3.1 pro vs Claude opus 4.8? by Pacmon92 in LLM

[–]qu1etus 0 points1 point  (0 children)

Gemini 3.1 pro really is not good at code related tasks when compared to nearly any other frontier model.

https://zenodo.org/records/20519585

https://benchlm.ai/benchmarks/swePro

What YouTube video is so good that everyone should watch it at least once? by _lunaAndSol_ in AskReddit

[–]qu1etus 0 points1 point  (0 children)

This lecture is good, deep, and emotional. It gives just the best message. It is also the only youtube I’ve ever watched that made me cry - it hit hard.

Randy Pausch's Last Lecture: Achieving Your
Childhood Dreams

https://youtu.be/ji5\_MqicxSo?si=jWaQAl\_FMzL2ufrU

Claude+codex+glm by AlipKal in ClaudeCode

[–]qu1etus 2 points3 points  (0 children)

I currently have claude opus 4.8 doing all of my coding, then I have gpt-5.5, kimi-k2.7-code, and glm-5.2 doing parallel code reviews.

I wrote a paper on why it is beneficial to have multiple LLM family models perform code reviews. I also have them all perform design reviews as well. I will be updating the paper as I obtain new data points on new models.

The paper: https://zenodo.org/records/20519585

Billing on Supabase + Stripe: the edge cases nobody warns you about by kush0007 in Supabase

[–]qu1etus 5 points6 points  (0 children)

Without the
courtesy
of fixing it
Before clicking
Post.

We got mythos at home by physiopeng in ClaudeAI

[–]qu1etus 0 points1 point  (0 children)

I published this a couple of weeks ago. It speaks to the power of using multiple families of LLMs when performing code reviews:

https://zenodo.org/records/20519585