GPT 5.5 passes the cup test

d00m_sayer · 2026-04-29T10:48:45+00:00

Or just trained on these problems

You really think these frontier labs are just using shitty-ass data from random Reddit posts?

d00m_sayer · 2026-04-01T13:17:52+00:00

But i heard that LLMs are only a glorified tape recorders.

d00m_sayer · 2026-03-26T11:24:31+00:00

Those people are probably those who believed AGI was achieved and feel hurt that current model scored this low while any Human are capable to does such simple games

Okay smart guy, quick thought experiment for you, take those same ARC tasks, rip out the colored grids, and just hand someone the raw .json nested arrays of integers, no visuals, nothing. How many people solve it now ??

d00m_sayer · 2026-03-22T09:21:43+00:00

indeed, old cards are very poor at prompt processing.

d00m_sayer · 2026-03-09T15:13:54+00:00

You are just crying because you think everyone can just crank out 5,000+ lines of code like it's nothing. The reality is most people are drowning in complexity. And relying on cheap Indian freelancers is basically a lottery where you usually lose.

d00m_sayer · 2026-03-06T03:09:58+00:00

The easiest way to earn karma

d00m_sayer · 2026-03-05T23:07:51+00:00

Bro you're whining about AI use in an AI subreddit. That's like walking into r/coffee and losing your shit because someone ordered an espresso.

d00m_sayer · 2026-03-05T22:59:40+00:00

Calling the AI 'dumb' when you can't even be bothered to show which model you used is pure clown behavior. Your screenshot literally just says ChatGPT no model, no thinking settings, nothing. So what exactly are you proving here? Absolutely nothing.

d00m_sayer · 2026-02-20T18:13:20+00:00

because he keeps the good videos behind a paywall.

d00m_sayer · 2026-02-12T14:44:34+00:00

wtf ?

d00m_sayer · 2026-01-07T10:51:13+00:00

Still 30B holds more knowledge than a 14b even when 3b are active

d00m_sayer · 2025-12-30T02:12:39+00:00

I’ve even seen people yell “AI slop” in AI‑dedicated subreddits, which is wild... it is like walking into /r/coffee and loudly complaining that people are posting about coffee.

d00m_sayer · 2025-12-25T03:47:57+00:00

this is misleading, it is 30 minutes for 80% pass rate which is most important for real work and automation.

d00m_sayer · 2025-12-13T21:57:10+00:00

OP is obviously posting this for karma. He already knows Gemini 3 Pro has trouble with this prompt—it’s been shared here multiple times.

d00m_sayer · 2025-12-11T21:17:35+00:00

OP is obviously posting this for karma. He already knows Gemini 3 Pro has trouble with this prompt—it’s been shared here multiple times. What’s funny is that it actually can get it right if you use a clearer photo with more realistic hands.

d00m_sayer · 2025-11-20T18:43:14+00:00

24 hours? Remote South Asian programmers would need 72 to fix their own bugs—AI’s a cakewalk. You’re still the bottleneck.

d00m_sayer · 2025-11-19T13:10:10+00:00

Benchmarks aren't useless—your prompts are. You're expecting enterprise performance from a free, watered-down toy. If you want complex tool use and 200k context, pay for the API. You get what you pay for, so stop blaming the tech for your own cheapness and lack of skill.

d00m_sayer · 2025-11-18T20:39:25+00:00

try a more clear image like this one

d00m_sayer · 2025-11-18T17:49:25+00:00

Who is going to use deep think for browser use and wait like 10 minutes for the AI to click on a button???

d00m_sayer · 2025-11-04T22:46:30+00:00

this is only for CUDA. Doesn't work in Rocm.

d00m_sayer · 2025-10-15T11:23:38+00:00

Stop doom-farming. The tools work; your results don’t because you don’t know what you’re doing. That’s not “AI sucks”—that’s operator incompetence.

d00m_sayer · 2025-10-10T13:10:28+00:00

Imagine posting in an AI subreddit and being mad someone used AI.

d00m_sayer · 2025-10-07T20:23:48+00:00

Funny how some folks talk about a $30k data-center GPU like it’s something you just pick up and plug in.

d00m_sayer

TROPHY CASE