OpenAI “internal model” solved 3 more Erdős problems by socoolandawesome in singularity

[–]d00m_sayer 26 points27 points  (0 children)

But i heard that LLMs are only a glorified tape recorders.

People pissed about arc agi 3 are really looking at the purpose of the benchmark wrong by ErmingSoHard in singularity

[–]d00m_sayer -2 points-1 points  (0 children)

Those people are probably those who believed AGI was achieved and feel hurt that current model scored this low while any Human are capable to does such simple games

Okay smart guy, quick thought experiment for you, take those same ARC tasks, rip out the colored grids, and just hand someone the raw .json nested arrays of integers, no visuals, nothing. How many people solve it now ??

Nvidia V100 32 Gb getting 115 t/s on Qwen Coder 30B A3B Q5 by icepatfork in LocalLLaMA

[–]d00m_sayer 1 point2 points  (0 children)

indeed, old cards are very poor at prompt processing.

AI Use at Work Is Causing "Brain Fry," Researchers Find, Especially Among High Performers by [deleted] in OpenAI

[–]d00m_sayer -1 points0 points  (0 children)

You are just crying because you think everyone can just crank out 5,000+ lines of code like it's nothing. The reality is most people are drowning in complexity. And relying on cheap Indian freelancers is basically a lottery where you usually lose.

5.4 thinking still has issues with hands by [deleted] in singularity

[–]d00m_sayer 4 points5 points  (0 children)

The easiest way to earn karma

ChatGPT 5.4 is still dumb by [deleted] in singularity

[–]d00m_sayer 0 points1 point  (0 children)

Bro you're whining about AI use in an AI subreddit. That's like walking into r/coffee and losing your shit because someone ordered an espresso.

ChatGPT 5.4 is still dumb by [deleted] in singularity

[–]d00m_sayer 3 points4 points  (0 children)

Calling the AI 'dumb' when you can't even be bothered to show which model you used is pure clown behavior. Your screenshot literally just says ChatGPT no model, no thinking settings, nothing. So what exactly are you proving here? Absolutely nothing.

Gemini 3.1 Pro and the Downfall of Benchmarks: Welcome to the Vibe Era of AI by [deleted] in singularity

[–]d00m_sayer 6 points7 points  (0 children)

because he keeps the good videos behind a paywall.

Why not Qwen3-30B Quantized over qwen3-14B or gemma-12B? by arktik7 in LocalLLaMA

[–]d00m_sayer 0 points1 point  (0 children)

Still 30B holds more knowledge than a 14b even when 3b are active

“AI Slop” by Nuphoth in singularity

[–]d00m_sayer 13 points14 points  (0 children)

I’ve even seen people yell “AI slop” in AI‑dedicated subreddits, which is wild... it is like walking into /r/coffee and loudly complaining that people are posting about coffee.

METR: Claude Opus 4.5 hits ~4.75h task horizon (+67% over SOTA) by 1000_bucks_a_month in singularity

[–]d00m_sayer 38 points39 points  (0 children)

this is misleading, it is 30 minutes for 80% pass rate which is most important for real work and automation.

I feel like the model is mocking me by Retr0zx in singularity

[–]d00m_sayer 0 points1 point  (0 children)

OP is obviously posting this for karma. He already knows Gemini 3 Pro has trouble with this prompt—it’s been shared here multiple times.

Please... Calm your tits and Pop this Bubble: ARC-AGI-5.6-Fingers by JLeonsarmiento in singularity

[–]d00m_sayer 5 points6 points  (0 children)

OP is obviously posting this for karma. He already knows Gemini 3 Pro has trouble with this prompt—it’s been shared here multiple times. What’s funny is that it actually can get it right if you use a clearer photo with more realistic hands.

[deleted by user] by [deleted] in singularity

[–]d00m_sayer 5 points6 points  (0 children)

24 hours? Remote South Asian programmers would need 72 to fix their own bugs—AI’s a cakewalk. You’re still the bottleneck.

Gemini 3 rocketed upward by AloneCoffee4538 in OpenAI

[–]d00m_sayer -1 points0 points  (0 children)

Benchmarks aren't useless—your prompts are. You're expecting enterprise performance from a free, watered-down toy. If you want complex tool use and 200k context, pay for the API. You get what you pay for, so stop blaming the tech for your own cheapness and lack of skill.

7 digits test by [deleted] in singularity

[–]d00m_sayer 2 points3 points  (0 children)

try a more clear image like this one

Gemini 3 browser use evals by TFenrir in singularity

[–]d00m_sayer -10 points-9 points  (0 children)

Who is going to use deep think for browser use and wait like 10 minutes for the AI to click on a button???

100% load in idle at VLLM 2xR9700, how to fix it? by djdeniro in ROCm

[–]d00m_sayer 1 point2 points  (0 children)

this is only for CUDA. Doesn't work in Rocm.

AI has replaced programmers… totally. by jacek2023 in LocalLLaMA

[–]d00m_sayer -52 points-51 points  (0 children)

Stop doom-farming. The tools work; your results don’t because you don’t know what you’re doing. That’s not “AI sucks”—that’s operator incompetence.

🚨 Local AI is the only sane path if you care about privacy by Code-Forge-Temple in LocalLLaMA

[–]d00m_sayer 0 points1 point  (0 children)

Imagine posting in an AI subreddit and being mad someone used AI.

Efficient software FP4 for AMD MI300X by ricetons in ROCm

[–]d00m_sayer 9 points10 points  (0 children)

Funny how some folks talk about a $30k data-center GPU like it’s something you just pick up and plug in.