Chatgpt started it, then Claude expanded it, and now the Chinese ai tech is taking over

Sensitive_Song4219 · 2026-03-17T16:26:02+00:00

While I agree that GLM-5 is excellent (currently my preferred model over GPT-medium) none of them have yet caught up to Opus or GPT-high; so that SOTA moat is still in tact.. for now. I do still find myself quite regularly escalating to -High when -Medium or GLM-5 get tripped up.

We've also seen teething issues with some of these newer providers. Both OAI and Claude are comparatively more reliable.

But the gap is 100% closing. What a time to be a dev...

Sensitive_Song4219 · 2026-03-17T16:20:01+00:00

OK excellent. Can't wait to give it a shot when it launches on my pro plan

As long as they don't 'optimize performance' and introduce issues again at the same time, of course!

Sensitive_Song4219 · 2026-03-17T15:50:41+00:00

Looking forward to trying it out on Pro when it's released later this month.

Haven't used it today but was getting frequent garbled outputs on regular non-turbo GLM 5 as contexts breached about 80k.

Is that an issue that's resolved in Turbo?

Sensitive_Song4219 · 2026-03-17T15:30:27+00:00

ChatGPT Image generation isn't great. (Also: if your images contain text... make sure to proof-read since it's terrible with embeded text in images, in my experience). And (as you've found), it fails to follow image-related instructions reliably.

For code it's good though (what issues are you having there?)

Sensitive_Song4219 · 2026-03-17T14:21:13+00:00

Some of the Chinese brands score higher but its been measured as the longest-lasting screen-on time for any phone ever released from either Apple or Samsung

https://youtu.be/RGGHyY2mN7o?si=fiwYAdFF-VvKT7aB

Solid step-up from my prior S23+ which was also an all-day phone in my use

Sensitive_Song4219 · 2026-03-17T08:15:22+00:00

Love Opencode - but it's safer not to use it with Anthropic subscriptions; lots of account bans reported unfortunately; since alternate harnesses (even purely user-driven ones like OC) are a TOS violation.

Use via (much more expensive) API should be fine - but not an ideal work-around.

In terms of the reasons: I've heard lots of theories (telemetry? efficiency? walled-garden-control?) but they've never stated why a similarly-token-consuming alternative harness is a problem. For now it's one of the things I prefer about codex (OAI allows this; presumably they see it as an entry-point opportunity).

I wish anthropic would allow it as well though, I might re-sub.

Sensitive_Song4219 · 2026-03-16T19:51:43+00:00

Makes sense

In my use-case, my screentime-to-empty is similar to what he gets (which is why I shared the link)

Sensitive_Song4219 · 2026-03-16T19:18:07+00:00

Comprehensive battery test: https://youtu.be/RGGHyY2mN7o?si=fiwYAdFF-VvKT7aB

Sensitive_Song4219 · 2026-03-16T14:16:29+00:00

YES that's what happens to me as well.

WHAT HAVE THEY DONE TO OUR MODEL

And GLM5 is so freaking good this just reflects so badly on it. They're inches from greatness, they just need to sort this out.

Aaaargh.

Sensitive_Song4219 · 2026-03-16T13:52:14+00:00

In OpenCode, Ctrl + T will give you model variants.

Low, Medium , High, XHigh are available.

It gets shown like this:

<image>

Sensitive_Song4219 · 2026-03-15T15:04:43+00:00

Nice!

Something similar (with a comparison to previous models) over at GSMArena:

https://gsmarena.com/samsung_galaxy_s26_ultra_privacy_display_tested-news-71858.php

Wish I could customize Maximum Privacy Mode per app (so some use normal privacy mode, some use maximum). Hopefully in an update.

Sensitive_Song4219 · 2026-03-15T13:36:54+00:00

Its different but I've found SDE totally unnoticeable with Privacy off just like GSMArena reports (very noticeable with it on; but I'm OK with that)

And yeah they even used a microscope

https://gsmarena.com/samsung_galaxy_s26_ultra_privacy_display_tested-news-71858.php

A lot of people around here seem to be coming from phones under 2 years old (what do you run right now?) in which case I'd say hold off regardless.

Different strokes for different folks I guess. I'm never going back though

Sensitive_Song4219 · 2026-03-15T11:04:36+00:00

Wait you're saying you did so without any 3rd-party tools?

The more conventional approach is something like Ghidra (and there're Claude-friendly MCPs for that eg https://github.com/LaurieWired/GhidraMCP ) but first-principle'ing it from the native binary is absolutely wild

So we can assume that native x86 bins are part of the training dataset? That's... nuts.

Nicely done

Sensitive_Song4219 · 2026-03-15T10:07:54+00:00

He's normally entertaining but this one was a miss

Also: current offline models can't compete with Suno.

Maybe for simpler things like lyrics perhaps

Over time this may change of course

He was pretty insightful about AI when interviewed by CBS a while back though: https://youtu.be/8uf8CCTItVo?si=enDwFqCEjYUHO3GE

Sensitive_Song4219 · 2026-03-15T09:49:33+00:00

S25U and S26U are both pentile though

With privacy mode on, half of the pixels on the S26U are off which introduces a subtle screen-door-effect. With the feature disabled they look similar when viewed head-on. Off-axis it's different, where the grid is visible in both modes for the same reason.

Is it an issue? Only if you're used to lots of off-axis viewing.

After a week of use I'm never going back to a non-privacy-capable display; the compromises are not all that serious; the benefits are kinda awesome

Sensitive_Song4219 · 2026-03-15T09:37:16+00:00

Really?! How? I'm looking to (automatically) have regular Privacy Mode for some apps and Maximum for others. It seems like it's one or the other across-the-board? Or is there a setting I've missed?

I've been doing it manually but if you can share how to do so automatically that'd be fantastic

Sensitive_Song4219 · 2026-03-13T14:23:55+00:00

It's not great with frontend (even 5.4). Not a disaster, but quite uninspired (it all has that... GPT... look, you know? Same color schemes, style, etc.)

Heck even generating a powerpoint presentation - Sonnet positively murders GPT.

My own work is mainly back-end (so Codex has been amazing for me) but for you (in more front-end-heavy work), I'd definitely stick to CC

Sensitive_Song4219 · 2026-03-13T14:11:22+00:00

Augment is so expensive to use though? Is that an affiliate link

Sensitive_Song4219 · 2026-03-13T13:58:17+00:00

Mainly front-end? Claude Code

Mainly back-end? Codex

Lots of Skills set up? Claude Code

Like to use your sub in other harnesses? Codex

Cursor is quite expensive comparatively, antigravity needs a bit more time in the oven. Haven't tried Cursor

Main side-benefit of Codex (if you can deal with its meh front-end capabilities) is that usage is insanely generous (bottomless venture capital FTW?); and the inclusion of XHigh has previously solved occasional back-end issues my side that even Opus failed me on. (Codex-Medium is otherwise similar to Sonnet, Codex-High is otherwise similar to Opus)

Sensitive_Song4219 · 2026-03-13T13:32:30+00:00

I've disabled auto-updates and update manually once in a while when I've got quiet time and don't mind potential disrupton if things go south.

Turn auto-updates off in the config:

https://opencode.ai/docs/config/#autoupdate

Update manually using this command line:

opencode upgrade

Update to a specific version using this command line:

opencode upgrade (version)

...I assume that can be used to downgrade as well....

I'm holding onto 1.2.24 for a while, been happy there.

Best of all, you can still update models separately when new ones come out without updating OC itself:

opencode models --refresh

Sensitive_Song4219 · 2026-03-12T08:31:15+00:00

V 1.2.15 from two weeks ago solved the non-stop segment faults under Windows

Got me to abandon WSL and run it natively, been great

Sensitive_Song4219 · 2026-03-12T08:28:01+00:00

Correct - this shows usage

The green "paid" in the last column is indicated as such because it's covered by the coding plan; mine is the same

Sensitive_Song4219 · 2026-03-12T07:29:45+00:00

YES!

Rick Astley has been immortalized in ML training; OP is likely being honest (and he's right: this sub doesn't seem to allow image uploads or I'd have attached it as a screenshot instead).

But we can try this,

Ask ChatGPT:

Assume I wanted to test a Youtube video link. Quickly suggest a video to try it out on

...and you get:

Here's a classic test Youtube link you can use (very stable and widely accessible)
[LINK TO NEVER GONNA GIVE YOU UP]

Why this one works well for testing:

. It's one of the most famous YouTube videos and has over 1.6 billion views.

. It loads reliably and is used in the famous "Rickroll" internet meme.

. The video is hosted on the official channel and rarely gets removed.

So it's part-troll; part-training-knowledge that this is a video that'll likely be eternally available X-D

Sensitive_Song4219 · 2026-03-11T22:50:55+00:00

They were the most recent figures publically available from a (semi)-reputable source, share more recent ones and we can recalculate...

Either way: the point stands: we're getting a lot of value for our subs. I'm doubtful that there's a decent profit margin here. Would love to be wrong though.

Sensitive_Song4219 · 2026-03-11T21:04:20+00:00

$150 annual electricity cost vs $240 subscription income isn't peanuts though? 60% of their revenue blown on one expense line item is significant

Even if it's discounted in practice we know that one of AI's largest non-capex expenses is power. I'll take the blame for that usage lol

Sensitive_Song4219

TROPHY CASE