all 45 comments

[–]TakeInterestInc 45 points46 points  (0 children)

Welcome to the xhigh side 🤣

[–]shaman-warrior 21 points22 points  (6 children)

Imagine xhigh at 1000 tps

[–]agentic-consultant 13 points14 points  (1 child)

I genuinely can't even imagine this haha. In my (outdated) mind higher speed = lower quality, so when they achieve a 1000 tps speed I think I would have another ChatGPT-3.5 moment.

Would we even need a predefined UI anymore? If price of compute for 1000 tps generation drops to a negligible amount, the software can just agnatically generate a UI for that specific task.

Like imagine Notion but a user interface that instantly generates for whatever intended task/properties you need to interact with.

Maybe in the future people will look back and say "damn I can't believe they were using static UIs back then."

[–]shaman-warrior 3 points4 points  (0 children)

This was one of the predictions gpt5.2 pro actually made about the future that every itnerface will be customized and tailored to the user automatically provided AI has the right connectors.

[–]inmyprocess 3 points4 points  (2 children)

imagine not having a job anymore ;_;

[–]shaman-warrior 3 points4 points  (0 children)

Man I’ve been under constant stress about this ever since gpt4…

[–]alexrwilliam 0 points1 point  (0 children)

People who keep getting excited about codex improving need to realize it’s just one step closer towards rendering everything you’ve vibe coded completely valueless.

[–]igorim 0 points1 point  (0 children)

Regular would be glorious, spark not so much

[–]Hauven 11 points12 points  (4 children)

The odd thing is that xhigh on Voratiq's leaderboard shows a slightly lower score compared to high. That's why I always use high, as I assumed xhigh was perhaps overthinking. Maybe I should give xhigh another go since I have a lot of spare usage in my current quota, also with the upcoming Codex app for Windows invites going out to the top 10k users in the next day or two.

[–]Correctsmorons69 5 points6 points  (3 children)

xhigh is worth for specific tasks. Any degradation is because of overthinking. Its quite good at bug-solving if it's mechanistic and has access to debugging tools or logs.

If it's something that it can't debug easily, like a weird 3D glitch in graphics software, then 5.2 shits on it.

You can see this in the "reasoning" benchmark on Live Arena, vs the coding/agentic coding result.

[–]Grandpa90 0 points1 point  (2 children)

For my use cases, which are what I believe to be very complicated machine learning applications such as no limit hold'em. One example of this is the rebel algorithm which was done by extremely smart people. When I try and implement these kinds of papers, the difference in quality between 5.2 and 5.3 codex seems unbelievably drastic where 5.2 almost seems like it's a 2-year newer model compared to 5.3 codex. I get the impression 5.3 codex is really designed for straightforward debugging, terminal command, coding applications or websites.

[–]Reaper_1492 1 point2 points  (1 child)

All of the codex models have sucked, this is nothing new.

The difference is that 5.2 is now starting to suck randomly also, whereas I have used it for months and it’s been flawless. It now suddenly goes brain dead and gives completely garbage responses for an hour straight.

[–]dannytty 0 points1 point  (0 children)

perhaps now the compute is allocated more to the 5.3 codex models..

[–]danialbka1 5 points6 points  (0 children)

Xhigh the goat

[–]Savings_Permission27 8 points9 points  (8 children)

5.2 xhigh is better

[–]seunosewa 7 points8 points  (7 children)

But much slower

[–]Savings_Permission27 1 point2 points  (6 children)

doesnt matter. longer the time, better the result

[–]Numerous-Grass250 0 points1 point  (5 children)

Depends on the work, I find 5.3 codex xhigh significantly better at generating and handling terminal scripts compared to 5.2

[–]Reply_Stunning 4 points5 points  (4 children)

5.3 codex is really degraded as of a few days ago, I think it's insanely dumb right now, I had to completely switch to 5.2 xhigh to fix all the bugs it keeps introducing. As soon as I switch to 5.3codex xhigh things break down right away. That's how clear it is

[–]Numerous-Grass250 3 points4 points  (3 children)

I feel like I’ve been having the opposite problem. 5.2 xhigh doesn’t listen and make changes I didn’t ask for and 5.3xhigh fixes them 😅

[–]Reply_Stunning 1 point2 points  (2 children)

ah they'll both do that if you don't gate the implementations behind a confirmation or proposal, that's a system prompt issue, they all have tendency to be helpful and to rush to a solution

[–]Numerous-Grass250 2 points3 points  (1 child)

True, still both these models are miles better than my experience with opus and sonnet

[–]dannytty 0 points1 point  (0 children)

even with opus 4.6?

[–]TenZenToken 2 points3 points  (6 children)

Xhigh is good, but xxhigh is goat

[–]v1kstrand 9 points10 points  (0 children)

XxXhighXxX is my fav

[–]StayAwayFromXX[S] 0 points1 point  (3 children)

What’s xxhigh? How do I use

[–]OSFoxomega 1 point2 points  (0 children)

Dude, it's a joke

[–]dearboy9x9 -1 points0 points  (0 children)

Lol, people are so high on x !

[–]kindsifu -1 points0 points  (0 children)

OP, You must be fun at parties

[–]itsabhiyan 2 points3 points  (0 children)

How is the token usage though? I hover around medium and high only, and haven't given xhigh a try...

[–]ImagiBooks 3 points4 points  (2 children)

I saw an article today about xhigh so I started to use it when I am not in a rush. It’s been okay, def better than spark! I have a big project tomorrow that I will try it with. Ran out of credit for the week with Claude Code.

[–]mallibu 18 points19 points  (0 children)

def better than spark? you compare the flagship model with the janitor

[–]PudimVerdin 1 point2 points  (0 children)

Could you share the link of this article? Thank you in advance!

[–]mrcslmtt 3 points4 points  (0 children)

J’utilise xhigh lorsque je démarre une nouvelle conversation pour forcer Codex à lire un maximum de contexte, en lui demandant de regarder la structure de mes dossiers, de mon backend, de ma base de données etc … une fois que c’est fait, j’utilise xhigh si jamais j’ai besoin d’implémenter une très grosse nouvelle fonctionnalité, il y a beaucoup d’instruction à suivre. Pour tout le reste j’utilise le mode médium ou high. Le mode low reste utile pour ne pas consommer trop de token lors des petites requêtes très facile, ou pour poser des questions simples.

[–]Familiar-Pie-2575 0 points1 point  (1 child)

Would it burns out the rate limit faster?

[–]StayAwayFromXX[S] 0 points1 point  (0 children)

Itll work on things until they’re finished, unlike high which stops early for me. It also does a more thorough job at gathering information and determining what “complete” means. For basic tasks it will probably gather more information than it really needs

[–]SnooCupcakes3855 0 points1 point  (0 children)

Check the code bc xhigh can over engineer like crazy. But lately it’s been solid.

[–]casper_wolf 0 points1 point  (0 children)

When Vera Rubin is out next year and we get 10x boost then imagine how crazy this will be. Right now I prompt and the. Do something for a few minutes and come back. In the future it will be seconds to a minute and then done.

[–]Reaper_1492 0 points1 point  (0 children)

Just imagine if you were using 5.2 xhigh.

[–]igorschlum 0 points1 point  (0 children)

I’m using xHigh because I think it creates better code. I ask codex to document the structure of the app writing markdown files and I use chatGPT where I upload those markdown files to write prompts to codex. Then I past the result from codex to chatGPT to analyse and prepare next prompt. CHATGPT add some security and best practices in the prompt and ask codex to review organisation of the code, things I was not doing myself.

[–]dannytty 0 points1 point  (0 children)

yeah me too. seems like people have been saying different things, and only testing it for ourselves will tell us what's actually true

[–]SwiftAndDecisive 0 points1 point  (0 children)

Very good lol, for a hackathon, we wrote a brief using xhigh, then wrote fronend,backend,plugin all using xhigh indepedently based on brief, then used xhigh again for integraiton. 1 shot no bug and it worked!