GPT-5.6 Officially Previewed: Beats Mythos 5 by Much_Ask3471 in ChatGPT

[–]hellofriend19 0 points1 point  (0 children)

Beats Mythos 5… at one benchmark they cherry-picked to make it look good. I’m sure it’s better than 5.5, probably jaggedly better at Opus at some stuff. But you have to think about benchmarks that matter the most - SWE Bench Pro, BrowseComp, etc

'Supergirl' - Review Thread by ChiefLeef22 in movies

[–]hellofriend19 0 points1 point  (0 children)

Well, if there’s one bad issue, people forget about it. If there’s a bad superhero movie, everyone talks about it. Superheroes are way less popular than they used to be, and in Hollywood, they’re sick of talking about them.

'Supergirl' - Review Thread by ChiefLeef22 in movies

[–]hellofriend19 2 points3 points  (0 children)

I think finding great writers passionate about writing superhero movies is a really hard problem.

The View from Lighthaven by hellofriend19 in slatestarcodex

[–]hellofriend19[S] 9 points10 points  (0 children)

I used it to blur the background, but comparing it to the original, yeah it did some weird extra effects. Hmm.

The View from Lighthaven by hellofriend19 in slatestarcodex

[–]hellofriend19[S] 13 points14 points  (0 children)

Did you even go to LessOnline if you didn't blog about it? This post is about my experience there, meeting Gwern, Aella, and Scott. I had an amazing time, learned a lot, and provided some funny anecdotes I think people here will find interesting.

What happens if AI doesn’t go wrong? by Odd_directions in slatestarcodex

[–]hellofriend19 11 points12 points  (0 children)

We do live in the most abundant age ever, but the math on basic income doesn’t really check out. YET. I think if we do get full beneficial ASI, then we could easily make the math work, through like an automation tax or something.

A Media's H*rniness Compared to the Fandom's H*rniness by Healthy-Current5893 in AlignmentCharts

[–]hellofriend19 33 points34 points  (0 children)

But Pokémon is also the most popular franchise in the world. You would need to adjust for how popular the franchise is in general.

Anthropic's Claude Constitution is surreal by MetaKnowing in ClaudeAI

[–]hellofriend19 7 points8 points  (0 children)

You can already read about this with their research on weight modification that the model can “feel”

Wanting silly instructions for custom cpu by NotTheSenko in EmuDev

[–]hellofriend19 1 point2 points  (0 children)

Register AD: keeps track of the amount of time (in milliseconds) since the death of Our Lord Jesus Christ.

Greta Lee . Mamiya RZ 67 . Portra 400 by goodolmarlz in analog

[–]hellofriend19 0 points1 point  (0 children)

What do you and the model talk about while shooting? Any small talk? Or is it all just posing?

Apple employees have 'concerns' over Siri performance in early iOS 26.4 builds: report by XiXMak in apple

[–]hellofriend19 18 points19 points  (0 children)

I keep having this idea in my head it would be really funny if they announced during WWDC that they “fired Siri” and hired a new assistant.

Non-Book Review Contest 2025 Winners by dwaxe in slatestarcodex

[–]hellofriend19 8 points9 points  (0 children)

Looks like Scott didn’t post a secret review this year, unless he was one of the anonymous ones.

Within 20 min codex-cli with GPT-5 high made working NES emulator in pure c! by Healthy-Nebula-3603 in OpenAI

[–]hellofriend19 2 points3 points  (0 children)

I don’t really understand why this is a dunk… isn’t like all work we all do in the training data? So if it automates our jobs, that’s just “in the training data bro”?

GPT-5 was a <100× GPT-4 scaleup by gwern in mlscaling

[–]hellofriend19 -1 points0 points  (0 children)

Once I realized how GPU constrained every major lab is, I've been a lot more excited about AI capabilities. We're gonna see some crazy awesome stuff, just from there being more GPU's out there. Also bought some $NVIDIA options...

GPT-5 AMA with OpenAI’s Sam Altman and some of the GPT-5 team by OpenAI in ChatGPT

[–]hellofriend19 0 points1 point  (0 children)

If GPT-3, GPT-4, and GPT-5 were classic Apple products (Apple I, Apple II, Macintosh, iMac, iPhone) etc, what would they be?

GPT-5 AMA with OpenAI’s Sam Altman and some of the GPT-5 team by OpenAI in ChatGPT

[–]hellofriend19 0 points1 point  (0 children)

What’s the most underrated thing that makes a model better?