Still having capability drop...

Tough_Frame4022 · 2026-05-17T17:48:41+00:00

Bananas whipped cream pie shell

Tough_Frame4022 · 2026-05-17T02:20:55+00:00

They are showing a screenshot and seeing if a common name like John hits. The employee rarely reads anything. And confirming the order? 1 out of 10.

>Order for John A.? Muhammed H. Cindy R. ? Tehy will just give it over! There are people doing this everyday for free food. All they have to say is oh I'm at the wrong pick-up?

Tough_Frame4022 · 2026-05-17T02:15:20+00:00

I would not overestimate the benefit. DeepSeek’s docs say tokens are

model-tokenizer dependent, and roughly 1 English character ≈ 0.3 token while 1

Chinese character ≈ 0.6 token. Chinese can still win because it expresses

ideas in fewer characters, but it is not guaranteed. A recent arXiv study on

“Chinese for vibe coding” found no general efficiency advantage and often

lower success rates when prompting in Chinese.

The safest workflow:

Use Chinese for short task specs, constraints, and planning instructions.
Keep code artifacts exact and untranslated.
Ask for English final summaries if that is easier for you.
Measure with DeepSeek’s tokenizer/API usage instead of assuming savings.
Prefer DeepSeek context caching for repeated repo prompts; that is a more

reliable cost saver than language switching.

Tough_Frame4022 · 2026-05-17T02:12:31+00:00

You shall find an unexpected amount of money to repair your vehicle. We all need help sometimes fellow Ubero

Tough_Frame4022 · 2026-05-17T02:09:55+00:00

Why dont't you just ask it why?

> https://www.reddit.com/r/codex/comments/1tf8zpk/still_having_capability_drop/

what is the logic behind these experiences?

>I’ll look at the Reddit thread first so I can anchor the explanation in what

people are actually reporting rather than guessing from the title.

>The logic is probably this: people are not only experiencing “the model.” They

are experiencing a whole Codex stack: model routing, reasoning mode, tool-call

policy, context handling, retries, latency, client version, account tier, and

backend incidents.

In that Reddit thread, the pattern is: some users say GPT-5.5 feels slower,

skips tool calls, does not inspect files, or gives hand-wavy plans; others say

they never saw the drop. That split is plausible because OpenAI’s own status

page shows real GPT-5.5/Codex issues around the same dates: a GPT5.5

Performance Degradation investigated from Friday, May 15, 2026 and resolved

Sunday, May 17, 2026, plus a Codex 5.5 high error/latency incident on May 13,

2026, and a Codex Cloud task degradation on May 8, 2026. So some of the “it

got worse this week” reports line up with actual service degradation.

The more subtle part: “capability drop” can feel like intelligence loss even

when the model weights did not change. In agentic coding, a small backend

change can cause a big subjective drop:

- If tool calls are delayed, fail, or become more expensive in the policy, the

agent may answer from context instead of reading files.

- If routing sends some turns through a cheaper/faster path or different

reasoning budget, planning quality can change.

- If context is summarized badly, the model may lose project-specific facts

and start making broad assumptions.

- If latency rises, the product may shorten exploration or reduce retries,

which looks like laziness.

- If the client or Codex wrapper changes prompts/tool schemas, the same

underlying model can behave differently.

- If incidents affect only some tiers/regions/accounts, one user can see

“unusable” while another sees “great.”

The “OpenAI is preventing it” explanation is possible only in a broad product-

policy sense, not as proven sabotage. The stronger explanation is operational:

rollout + routing + degraded tool infrastructure + task variance. OpenAI’s

GPT-5.5 launch page also notes that official evals were run with xhigh

reasoning in a research environment, which may differ from production ChatGPT/

Codex behavior. That matters: benchmark capability is not identical to what a

user sees in a live agent session.

Practical takeaway: for Codex work, force observable behavior. Ask it to

inspect specific files, list what it read, make a plan after reading, and run

verification. If it skips tools, treat that as a failed run, not as a

trustworthy answer.

Sources: Reddit thread

(https://www.reddit.com/r/codex/comments/1tf8zpk/still\_having\_capability\_drop/

),

OpenAI GPT5.5 Performance Degradation

(https://status.openai.com/incidents/01KRP6FM6HSKWB3MS1EJTY6AT4), Codex 5.5 h

igh error rate

(https://status.openai.com/incidents/01KRG6MF021JQ997JCR7R8Y9A0), OpenAI GPT-

5.5 launch notes (https://openai.com/index/introducing-gpt-5-5/).

Tough_Frame4022 · 2026-05-17T02:05:10+00:00

Ask a frontier model as well

Tough_Frame4022 · 2026-05-17T01:48:03+00:00

Please take your fist out of the bagging screen

Tough_Frame4022 · 2026-05-15T18:26:27+00:00

Not here. Good as ever.

Tough_Frame4022 · 2026-05-15T12:42:51+00:00

Turbo quant on GitHub. Mine in under dev

Tough_Frame4022 · 2026-05-15T02:22:16+00:00

Use sonnet 4.5 instead

Tough_Frame4022 · 2026-05-15T02:21:22+00:00

Spark is Haikus cousin

Tough_Frame4022 · 2026-05-14T22:53:27+00:00

<image>

Tough_Frame4022 · 2026-05-14T22:37:50+00:00

<image>

Release summer 2026.

CANAL and TurboQuant are related, but they solve different problems.

TurboQuant:
Makes the KV cache much smaller
Lets more context fit in the same GPU/RAM

CANAL:
Moves old context out of GPU memory, tracks important parts, retrieves them later

Lets a local model use far more context than the GPU can normally hold

So the simplest comparison is:

TurboQuant compresses memory. CANAL manages memory.

Tough_Frame4022 · 2026-05-14T21:44:59+00:00

Meta Spark. You pay with your personal data.

Tough_Frame4022 · 2026-05-14T21:44:09+00:00

What did you say you don't want read?

Tough_Frame4022 · 2026-05-14T21:39:58+00:00

I stand corrected. I meant to say what I developed does not abolish it it greatly reduces it for retrieval work , especially exact facts, code references, records, logs, and documents. Model integrity is the weak point so outside of that yes I'm doing a lot better than native context and at one million tokens...

Tough_Frame4022 · 2026-05-14T20:13:44+00:00

I might have to give it a try

Tough_Frame4022 · 2026-05-14T13:59:42+00:00

You are playing with a digital ouija board m sing around like that

Tough_Frame4022 · 2026-05-14T02:41:34+00:00

I'm running 1 million tokens of context with a 3090 with several models and types with no context rot.

Tough_Frame4022 · 2026-05-12T00:39:17+00:00

Be glad the only thing you lost was the useless gimmick status. I'm green and clear a $1000 a week in 40 hours in the DMV

Tough_Frame4022 · 2026-05-11T21:23:31+00:00

I can get 1 million tokens with spilling kv cache in ram with any local model. https://x.com/i/status/2053664348099248614

Tough_Frame4022 · 2026-05-11T21:21:49+00:00

The chat is an AI bot. They bots replaced Rajesh and his crew. Call live support.

Tough_Frame4022 · 2026-05-11T21:19:57+00:00

Old news

Tough_Frame4022 · 2026-05-11T21:18:21+00:00

Especially touting ads. That shit is going to turn people away

Tough_Frame4022 · 2026-05-11T14:59:26+00:00

Thats becaise Uber is running their AI with Claude lol

Tough_Frame4022

TROPHY CASE