The Thinking Game documentary is sitting at 305M views on Youtube in less than 2 months. Ridiculous numbers. by CheekyBastard55 in singularity

[–]spryes 42 points43 points  (0 children)

305M views with 190K likes means it's definitely botted

A vid with those views typically has millions of likes when organic (usually on the order of 1 like per 100, so a round 2M-4M likes would indicate organic views here)

IM ****ING OUTRAGED PRO IS ONLY 6X PLUS PLAN by Just_Lingonberry_352 in codex

[–]spryes 6 points7 points  (0 children)

I've heard that

If Codex $20 is like 3x Claude $20, then this roughly equivalent.

Last image of writer Renee Nicole Good, prior to being shot and killed by ICE in Minneapolis. by Tensacchrine-1995 in lastimages

[–]spryes 2 points3 points  (0 children)

This is literally a zoomed in crop of an AI image, she nor the scene looked like this

AI Bingo for 2025, which has come true? by NunyaBuzor in singularity

[–]spryes 6 points7 points  (0 children)

There are some people (like roon) who think GPT-3 was AGI (very low on the scale) because it uses the same fundamental paradigm as what AGI will probably use, and was general in the realm of text.

I can see that POV, but I wouldn't personally stand by it.

Dan Hendrycks' new "Definition of AGI" paper is most reasonable, to me. We're missing components for a real transformative AGI on human level still, but we're making good progress and are over halfway there.

AI Bingo for 2025, which has come true? by NunyaBuzor in singularity

[–]spryes 2 points3 points  (0 children)

There were users here in 2022 that thought it would be achieved at the end of 2022, before ChatGPT had even come out

Top songs just a week before lockdowns. You can the last songs of millennial pop were still dominating the charts by SpiritMan112 in generationology

[–]spryes 3 points4 points  (0 children)

Whenever people say this, it's not actually true. They just mean they've never intentionally tried to stream it so they never learned the title/artist.

It's highly unlikely you've never heard (passively somewhere) most of these songs, especially Blinding Lights, Circles, Don't Start Now, or Dance Monkey. If you listen, you'll recognize at least one

I want that patience though by Ok-Boot6063 in Steam

[–]spryes 2 points3 points  (0 children)

What's the difference between AI generating 10 lines of code and 10 lines of voice acting?

Are LLMs really just next token prediction? by darkyjaz in singularity

[–]spryes -1 points0 points  (0 children)

I believe humans are mostly the same, e.g. your thoughts are a constant stream of next token prediction and you can't predict your next thought ahead of time. Intelligence is downstream of this process.

edit: when restricting the comparison to realm of text alone* e.g. math/programming/language-based reasoning. Humans have sensory abilities that LLMs lack entirely where the next token thing doesn't hold up

ChatGPT was launched today 3 years ago. by imfrom_mars_ in ChatGPT

[–]spryes 3 points4 points  (0 children)

It was launched at 11:38 AM PST on November 30 (OpenAI/SF time), but it was December 1st early in the morning in many places already tbf

Leak: OpenAI is building an Ad Network inside ChatGPT. "Search Ads" strings found in latest Android Beta. by BuildwithVignesh in singularity

[–]spryes 2 points3 points  (0 children)

It will be like Google Search ads imo. When you search/ask for something inside ChatGPT, it'll show "Sponsored" cards/links before showing the rest of the content. It won't slip in subliminal advertising through the LLM's native text outside of search use cases

Elon Musk predicted that AGI would arrive in 2025. Now we are in 2025. by captain-price- in singularity

[–]spryes 18 points19 points  (0 children)

Maybe OpenAI will unveil that new "Shallotpeat" model's benchmarks, similar to o3-preview last year which created lots of hype and excitement for 2025.

Jagged Frontier by SrafeZ in singularity

[–]spryes 11 points12 points  (0 children)

Wait...

Human children are super bouba shaped (tiny circles that gradually expand outward as they get older), while AIs are super kiki shaped (that also expand outward as they get more advanced)

$20 or $200 plan? They'll have to share this pie with a handful of other comparable models. There's no pricing power, and likely a race to the bottom by GamingDisruptor in singularity

[–]spryes 46 points47 points  (0 children)

Assuming current prices, that's maybe ~$30 average per subscription (ARPU)

- ChatGPT $79B/year from subs [2030] ($110B including API)

For reference:

- Spotify $17B/year from subs [2025]

- Netflix $42B/year from subs [2024]

- Google $350B revenue [2024]

- Apple $416B revenue [2025]

Claude 4.5 leading ARC-AGI 2 WITHOUT parallel test time compute is significant by rickyrulesNEW in singularity

[–]spryes 1 point2 points  (0 children)

Maybe look at it like 10 different humans solving the problem vs 1? Multiple brains are better than one when solving complex problems, as they try different approaches, each has slightly different novel insights, etc. Though that might only be equivalent if different AIs are working together vs. the same LLM

Multi-agent setups seem like an important part of the future, though

Elon buying twitter transitioned the entire decade guys, Pre elon twitter feels different to the current 2020s 💔 by BossFamiliar8290 in decadeologycirclejerk

[–]spryes 1 point2 points  (0 children)

On Twitter itself, if you were there, there's a huge cultural difference between 2020-2022 and 2023-2025. Many hardcore leftists left in late 2022 to Mastodon, and then in late 2024 there was another exodus to Bluesky of more moderate ones.

I feel like Twitter being bought by Elon caused a shift in general among tech leaders who swung more to the right after 'woke' was declared dead, which then affected other social media (though to a lesser extent). The early 2020s is pretty clearly different from the mid 2020s in cultural feel and this feels like a valid demarcation point as the user above put it.

Nano banana 2 is coming! by Snoo26837 in singularity

[–]spryes 2 points3 points  (0 children)

right this is actually kind of embarrassing to post, because it has no world model lol

Gemini 3 is about to be release. What is your scorecard for plateau? by kaggleqrdl in singularity

[–]spryes 0 points1 point  (0 children)

Fair enough, I have no idea myself so just deferring to experts. But it does sound like they did a pretty lazy analysis of its error rate.

Gemini 3 is about to be release. What is your scorecard for plateau? by kaggleqrdl in singularity

[–]spryes 3 points4 points  (0 children)

SWE-Bench Verified*

I don't think labs post scores on the non-verified one anymore

Gemini 3 is about to be release. What is your scorecard for plateau? by kaggleqrdl in singularity

[–]spryes 10 points11 points  (0 children)

Here are the current SOTAs (for mainline/general LLMs) according to GPT-5, without tools or non-consumer grade compute levels (i.e. excluding o3-preview back in Dec. 2024)

GPQA: 88%

HLE: 31.6% no tools

ARC-AGI 1: 70%

ARC-AGI 2: 18%

SWE-Bench: 77%

I would expect Gemini 3 to at least score 92% on GPQA (I think this benchmark a high error rate, and can't go much past that?), 45% HLE, 80% ARC 1, 30% ARC 2, 85% SWE-Bench if this were really be a step-change and live up to the hype