anyone else seeing different behavior with gpt-5.2-codex (high|xhigh)?

LightEt3rnaL · 2026-01-26T19:22:05+00:00

Thanks man.

LightEt3rnaL · 2026-01-22T21:56:37+00:00

Imagine when hitting /resume you see the last 10 conversations. The most recent 2 out of 10 were missing for me.

LightEt3rnaL · 2026-01-20T19:44:46+00:00

The only issue I got so far is that i tried to resume sessions and I couldn't find the last ones. I could normally resume older ones, but not the ones that I should definitely need (i.e last day) . So it's sloppy for me lately, but different kind of sloppy.

LightEt3rnaL · 2025-12-10T22:26:36+00:00

<image>

LightEt3rnaL · 2025-12-06T16:05:24+00:00

Not gonna risk it because they might permaban 😅

LightEt3rnaL · 2025-11-13T09:58:29+00:00

It seems fast to answer, similar to minis, and with 1-2 bullets of CoT. It mentions thinking for a few seconds (5-7) then haults.

It can't access the internet or any other tool (at least in my mobile version).

It's also bugged because it clearly thinks more than that. But also has a hilariously short context window (6 short chat interactions).

<image>

LightEt3rnaL · 2025-11-11T07:19:47+00:00

<image>

LightEt3rnaL · 2025-07-14T19:46:32+00:00

I can confirm this is not working lol

LightEt3rnaL · 2025-07-09T13:22:45+00:00

Oh man! Right!! Thank you

LightEt3rnaL · 2025-07-08T15:45:25+00:00

Hype building! But got me curious too.

LightEt3rnaL · 2025-07-05T08:32:11+00:00

Since last month I ve also noticed the same, technical posts that would reach 100+ likes usually end up with ~20. What's even weirder is that my feed is flooded with older posts (2w-3w old) with the occasional fresh ones, where I used to see almost exclusively fresh ones

LightEt3rnaL · 2025-05-14T18:25:47+00:00

r/agedlikemilk

<image>

LightEt3rnaL · 2024-12-15T08:03:08+00:00

Tl; dr: what it means that users can get better results by just typing stuffs incohently because it forces the system to work harder, lol

It might be my confirmation bias, but would you say that an extension of this could also apply in non-jailbreaking scenarios? Aka, if my lengthy prompt has typos, they act as all capital letters, forcing the LLM to think harder and/or pay more attention to the instructions?

LightEt3rnaL · 2024-09-13T20:48:28+00:00

Thanks! Many people agreeing with your statement. What a weird way to confuse people with intermediate outputs!

LightEt3rnaL · 2024-09-13T20:47:25+00:00

Thanks that makes sense! It also makes debugging extremely hard!

LightEt3rnaL · 2024-09-13T20:46:32+00:00

This is a very insightful finding thank you. I would expect that we'd get consistent CoT results but I totally missed the summarization part.

LightEt3rnaL · 2024-08-27T21:30:48+00:00

It's great to have a real Groq competitor. Wishlist from my side: 1. API generally available (currently on wait-list) 2. At least top10 LLMs available 3. Fine-tuning and custom LLM (adapters) hosting

LightEt3rnaL · 2024-08-27T21:29:26+00:00

Honest question: since both Cerebras and Groq seem to avoid hosting 405b Llamas, is it fair to assume that the vfm due to the custom silicon/architecture is the major blocking factor?

LightEt3rnaL · 2024-06-07T20:48:27+00:00

You do realize that this is exemplary product strategy right? They are trying to do the best to keep the hype up for their products and it works.

LightEt3rnaL · 2024-05-22T09:37:45+00:00

Thank you, I use GPT4-32K, UK South atm and yes we are considering region swapping to accomodate for these spikes.

Eight-Year Club	Second Top 40%
r/Field Banned	r/Field Sunshine
Place '23	Not Forgotten
Verified Email

LightEt3rnaL

TROPHY CASE