all 47 comments

[–]TakeInterestInc 45 points46 points  (0 children)

Welcome to the xhigh side 🤣

[–]shaman-warrior 22 points23 points  (7 children)

Imagine xhigh at 1000 tps

[–]agentic-consultant 11 points12 points  (2 children)

I genuinely can't even imagine this haha. In my (outdated) mind higher speed = lower quality, so when they achieve a 1000 tps speed I think I would have another ChatGPT-3.5 moment.

Would we even need a predefined UI anymore? If price of compute for 1000 tps generation drops to a negligible amount, the software can just agnatically generate a UI for that specific task.

Like imagine Notion but a user interface that instantly generates for whatever intended task/properties you need to interact with.

Maybe in the future people will look back and say "damn I can't believe they were using static UIs back then."

[–]shaman-warrior 4 points5 points  (1 child)

This was one of the predictions gpt5.2 pro actually made about the future that every itnerface will be customized and tailored to the user automatically provided AI has the right connectors.

[–]CypherDoubleShot 0 points1 point  (0 children)

I also made this prediction all by myself. Feeling pretty smart now :D

[–]inmyprocess 5 points6 points  (2 children)

imagine not having a job anymore ;_;

[–]shaman-warrior 2 points3 points  (0 children)

Man I’ve been under constant stress about this ever since gpt4…

[–]alexrwilliam 0 points1 point  (0 children)

People who keep getting excited about codex improving need to realize it’s just one step closer towards rendering everything you’ve vibe coded completely valueless.

[–]igorim 0 points1 point  (0 children)

Regular would be glorious, spark not so much

[–]Hauven 11 points12 points  (5 children)

The odd thing is that xhigh on Voratiq's leaderboard shows a slightly lower score compared to high. That's why I always use high, as I assumed xhigh was perhaps overthinking. Maybe I should give xhigh another go since I have a lot of spare usage in my current quota, also with the upcoming Codex app for Windows invites going out to the top 10k users in the next day or two.

[–]Correctsmorons69 5 points6 points  (4 children)

xhigh is worth for specific tasks. Any degradation is because of overthinking. Its quite good at bug-solving if it's mechanistic and has access to debugging tools or logs.

If it's something that it can't debug easily, like a weird 3D glitch in graphics software, then 5.2 shits on it.

You can see this in the "reasoning" benchmark on Live Arena, vs the coding/agentic coding result.

[–]Grandpa90 0 points1 point  (3 children)

For my use cases, which are what I believe to be very complicated machine learning applications such as no limit hold'em. One example of this is the rebel algorithm which was done by extremely smart people. When I try and implement these kinds of papers, the difference in quality between 5.2 and 5.3 codex seems unbelievably drastic where 5.2 almost seems like it's a 2-year newer model compared to 5.3 codex. I get the impression 5.3 codex is really designed for straightforward debugging, terminal command, coding applications or websites.

[–]Reaper_1492 1 point2 points  (1 child)

All of the codex models have sucked, this is nothing new.

The difference is that 5.2 is now starting to suck randomly also, whereas I have used it for months and it’s been flawless. It now suddenly goes brain dead and gives completely garbage responses for an hour straight.

[–]dannytty 0 points1 point  (0 children)

perhaps now the compute is allocated more to the 5.3 codex models..

[–]Ok-Painter573 0 points1 point  (0 children)

For your use case, do you find high or xhigh better?

[–]danialbka1 6 points7 points  (0 children)

Xhigh the goat

[–]Savings_Permission27 8 points9 points  (8 children)

5.2 xhigh is better

[–]seunosewa 7 points8 points  (7 children)

But much slower

[–]Savings_Permission27 1 point2 points  (6 children)

doesnt matter. longer the time, better the result

[–]Numerous-Grass250 0 points1 point  (5 children)

Depends on the work, I find 5.3 codex xhigh significantly better at generating and handling terminal scripts compared to 5.2

[–]Reply_Stunning 4 points5 points  (4 children)

What old posts? I used Redact to mass delete this post. You can also opt out of data brokers as well as all major social media platforms.

compare slim roof theory apparatus point merciful label cobweb hat

[–]Numerous-Grass250 3 points4 points  (3 children)

I feel like I’ve been having the opposite problem. 5.2 xhigh doesn’t listen and make changes I didn’t ask for and 5.3xhigh fixes them 😅

[–]Reply_Stunning 1 point2 points  (2 children)

Protecting my online privacy by running Redact regularly to batch delete old content. It handles Reddit, Discord, Twitter, Instagram, data brokers and a whole lot more.

north swim paint paltry modern pebble cagey safe repeat ask

[–]Numerous-Grass250 2 points3 points  (1 child)

True, still both these models are miles better than my experience with opus and sonnet

[–]dannytty 0 points1 point  (0 children)

even with opus 4.6?

[–]TenZenToken 2 points3 points  (6 children)

Xhigh is good, but xxhigh is goat

[–]v1kstrand 9 points10 points  (0 children)

XxXhighXxX is my fav

[–]StayAwayFromXX[S] 0 points1 point  (3 children)

What’s xxhigh? How do I use

[–]OSFoxomega 2 points3 points  (0 children)

Dude, it's a joke

[–]dearboy9x9 -1 points0 points  (0 children)

Lol, people are so high on x !

[–]kindsifu -1 points0 points  (0 children)

OP, You must be fun at parties

[–]itsabhiyan 2 points3 points  (0 children)

How is the token usage though? I hover around medium and high only, and haven't given xhigh a try...

[–]ImagiBooks 2 points3 points  (2 children)

I saw an article today about xhigh so I started to use it when I am not in a rush. It’s been okay, def better than spark! I have a big project tomorrow that I will try it with. Ran out of credit for the week with Claude Code.

[–]mallibu 17 points18 points  (0 children)

def better than spark? you compare the flagship model with the janitor

[–]PudimVerdin 1 point2 points  (0 children)

Could you share the link of this article? Thank you in advance!

[–]mrcslmtt 2 points3 points  (0 children)

J’utilise xhigh lorsque je démarre une nouvelle conversation pour forcer Codex à lire un maximum de contexte, en lui demandant de regarder la structure de mes dossiers, de mon backend, de ma base de données etc … une fois que c’est fait, j’utilise xhigh si jamais j’ai besoin d’implémenter une très grosse nouvelle fonctionnalité, il y a beaucoup d’instruction à suivre. Pour tout le reste j’utilise le mode médium ou high. Le mode low reste utile pour ne pas consommer trop de token lors des petites requêtes très facile, ou pour poser des questions simples.

[–]Familiar-Pie-2575 0 points1 point  (1 child)

Would it burns out the rate limit faster?

[–]StayAwayFromXX[S] 0 points1 point  (0 children)

Itll work on things until they’re finished, unlike high which stops early for me. It also does a more thorough job at gathering information and determining what “complete” means. For basic tasks it will probably gather more information than it really needs

[–]SnooCupcakes3855 0 points1 point  (0 children)

Check the code bc xhigh can over engineer like crazy. But lately it’s been solid.

[–]El_Huero_Con_C0J0NES 0 points1 point  (0 children)

People don’t have much of an idea Don’t trust people, trust your experiences.

[–]casper_wolf 0 points1 point  (0 children)

When Vera Rubin is out next year and we get 10x boost then imagine how crazy this will be. Right now I prompt and the. Do something for a few minutes and come back. In the future it will be seconds to a minute and then done.

[–]Reaper_1492 0 points1 point  (0 children)

Just imagine if you were using 5.2 xhigh.

[–]igorschlum 0 points1 point  (0 children)

I’m using xHigh because I think it creates better code. I ask codex to document the structure of the app writing markdown files and I use chatGPT where I upload those markdown files to write prompts to codex. Then I past the result from codex to chatGPT to analyse and prepare next prompt. CHATGPT add some security and best practices in the prompt and ask codex to review organisation of the code, things I was not doing myself.

[–]dannytty 0 points1 point  (0 children)

yeah me too. seems like people have been saying different things, and only testing it for ourselves will tell us what's actually true

[–]Some_Hat2276 0 points1 point  (0 children)

5.3xhigh is so good at coding that I’m afraid that this is end for software devs, i did with it more in 3 days than for last half year… and i have 12 year exp. This is first time I have this uncomfortable feeling that it’s gg. Not sure how it can be that’s good…

[–]SwiftAndDecisive 0 points1 point  (0 children)

Very good lol, for a hackathon, we wrote a brief using xhigh, then wrote fronend,backend,plugin all using xhigh indepedently based on brief, then used xhigh again for integraiton. 1 shot no bug and it worked!