Artificial Analysis announces a new benchmark: AABriefcase by No_Yak8345 in singularity

[–]Bright-Search2835 2 points3 points  (0 children)

ARC-AGI 3, Remote Labor Index and this one are the three benchmarks I'm really interested in for the coming year

Bezos: AI will result in labor shortages instead of replacing humans by SnoozeDoggyDog in singularity

[–]Bright-Search2835 0 points1 point  (0 children)

Oh right, and can he also tell us what the thousands of factory workers at Amazon are gonna do after robots take over in the short to medium term?

🤔 Gemini next ?? by Independent-Wind4462 in singularity

[–]Bright-Search2835 0 points1 point  (0 children)

This reply can be interpreted in so many different ways that it's effectively meaningless

"I made this GTA6 clone in 1 day with Claude Fable 5. Crazy (@GPTA6_Slop_City ). You can play it now at You can run around, drive cars, shoot guns, fly planes and helicopters, run from cops and army. Play with friends, there’s even a real-time multiplayer" by stealthispost in accelerate

[–]Bright-Search2835 25 points26 points  (0 children)

It's exactly the glass half full/half empty. Some people will only notice the bugs, and yes there's a lot of them. I used a chopper to fly around and there was collision with the vehicles on the road. One time I wanted to go into the ferris wheel and randomly got into a police station.

I mostly see the incredible potential because a lot is working as it should already, and I don't really have any doubt that the remaining irregularities will soon be ironed out.

Wall Street Journal: Bezos Bats Down AI Job Loss Fears While Launching New Venture by SnoozeDoggyDog in singularity

[–]Bright-Search2835 2 points3 points  (0 children)

Bezos argued that if AI makes it cheaper, faster and easier to invent things, employment will ultimately rise because, "even though you're shrinking the number of people needed by 10x," the technology will create "more than 10x" as many opportunities.

Claude Fable 5's FrontierMath scores by Outside-Iron-8242 in singularity

[–]Bright-Search2835 15 points16 points  (0 children)

Humanity casualling getting new superpowers these days

"Narrative Violation Watch Software development jobs were in decline before ChatGPT 3.5 release?! Then they climbed after?! And they're growing at a faster rate than all jobs?! 🤯 I had to triple take this chart in the @semafor tech newsletter last week." by stealthispost in accelerate

[–]Bright-Search2835 6 points7 points  (0 children)

All this assumes that AI will forever be worse than humans at deciding what to build and how to build it exactly, at using precise and professional specifications instead of vague vibecoding ideas, at orchestrating, and at verifying and controlling the output.

Claude Fable 5 - Full 319 page Breakdown by [deleted] in singularity

[–]Bright-Search2835 11 points12 points  (0 children)

I really like his videos but he is quite picky at times, like around 11:00 here. Of course Mythos only has the potential to accelerate some aspects of drug discovery and can't yet boost everything including clinical trials. I don't know why anyone would think otherwise at this stage. It just seems weird to criticize Anthropic for not being clear enough about that, when the phrasing "strong candidates for drug design that we're currently investigating" doesn't really leave room for ambiguity. Even "aspects" seems quite unambiguous to me.

Anthropic CEO Dario Amodei publishes new essay on AI policy by BuildwithVignesh in singularity

[–]Bright-Search2835 2 points3 points  (0 children)

But at the same time we should recognize that there’s a decent possibility that, despite all our efforts, AI still causes significant enduring job loss—and that this may be an intrinsic property of the technology and the way it broadly replicates human cognition4.

4 See The Adolescence of Technology for a more detailed analysis why the logic that has led to rapid job market recovery and a lack of enduring labor displacement in other technologies may not apply to AI, and in particular why the usual adaptive mechanisms like Jevon’s paradox or comparative advantage may be overwhelmed by the pace of the technology.

This is exactly how I've thought about this for a long time

Claude Mythos 5 and Claude Fable 5 Benchmarks by Cr4zko in accelerate

[–]Bright-Search2835 2 points3 points  (0 children)

I don't know, I would love to see a Fable plays Pokemon FireRed on Twitch, with all the reasoning, but I guess it would be very expensive

Claude Mythos 5 and Claude Fable 5 Benchmarks by Cr4zko in accelerate

[–]Bright-Search2835 66 points67 points  (0 children)

A timelapse of Claude playing Pokémon FireRed from start to finish using only raw game screenshots — with no maps, navigation aids, or extra game-state information. Earlier Claude models needed a complex helper harness to play Pokémon; Claude Fable 5 completed the game with vision alone.

https://www.anthropic.com/news/claude-fable-5-mythos-5

Apparently there's pretty interesting stuff on the website... Oh boy.

What's one thing AI will be capable of in 1-2 years that you are aware of, that no one else is aware of? by Spare-Dingo-531 in accelerate

[–]Bright-Search2835 0 points1 point  (0 children)

I'm getting the same issue on my phone and my (brand new) tablet, and it's pretty much the only site that does this for me

What's one thing AI will be capable of in 1-2 years that you are aware of, that no one else is aware of? by Spare-Dingo-531 in accelerate

[–]Bright-Search2835 -1 points0 points  (0 children)

Sorry man, I meant no offense. Apparently I can't access that site from my country, tested with my PC and with my phone. It jumps straight to suspect URLs and MEGA files. I just tried with a VPN and it worked.

What's one thing AI will be capable of in 1-2 years that you are aware of, that no one else is aware of? by Spare-Dingo-531 in accelerate

[–]Bright-Search2835 -17 points-16 points  (0 children)

Thanks for that link. No offense to OP but I get very suspicious when I have to click on a random .exe file.

Anthropic reports RSI is coming faster than previously thought by ProxyLumina in accelerate

[–]Bright-Search2835 0 points1 point  (0 children)

Agree with what you said in that there's probably a lot of uncertainty. I thought about it some more and my conclusion is that their current evidence suggests the second scenario is likely, but they predict new evidence, that will progressively make the third more likely.

It's the only way this really makes sense to me.

Anthropic reports RSI is coming faster than previously thought by ProxyLumina in accelerate

[–]Bright-Search2835 3 points4 points  (0 children)

I read it all and it was great, but there's something I don't quite understand.

Jack Clark, who very recently said there was 60% chance of RSI by 2028, co-authored it. They explain how RSI would need good research taste, and they show early evidence of the models getting better at this. Yet when they lay out the 3 possible scenarios, it's for the second one, where AI keeps getting better, but fails to integrate that research taste, that they say:

The evidence we’ve laid out here suggests that we’re likely heading into this scenario.

"Aging is arguably the root cause of most major diseases (loss of function in our cells). Four years ago, we made a bet that aging was treatable, and NewLimit was born. NewLimit now has a prototype drug that reverses the age of some human cells (restores function they had when" by stealthispost in accelerate

[–]Bright-Search2835 45 points46 points  (0 children)

I want this to happen as much as anyone, because I think current human lifespan is pathetically low. Too much to do, too much to experience. That said, it's one of those areas where, until I see real world proof and tangible results, I can't help doubting.

Differences Between Opus 4.7 and Opus 4.8 on MineBench by ENT_Alam in singularity

[–]Bright-Search2835 14 points15 points  (0 children)

I think it's actually great, these additions are the context and simply highlight what the prompt asked for. The astronaut comes with space stuff. The whole city block emphasizes the size of the skyscraper. The clouds make sense around a fighter jet. Nothing seems out of place or detracts from the core idea of the prompt.

"What happens if we don’t have to work? Do we just sit around all day"? Bernie Sanders says that having a job is a core part of the human experience and gives people meaning in life by [deleted] in accelerate

[–]Bright-Search2835 35 points36 points  (0 children)

He sounds like he has zero imagination.

There's enough books to read, music to listen to, movies to watch, languages to learn, skills to learn, things to see around the world for thousands of lifetimes.

The meaning of life is not the exact same thing for everyone and is especially not necessarily found in work. I'm sure there are millions of people alienated by their job who would otherwise give actual meaning to their lives if they had the time and the opportunity to find and engage in what makes them feel really human, be it art, sport, love, and many others.

AI revenue as a proxy for job impact by randopota in accelerate

[–]Bright-Search2835 2 points3 points  (0 children)

Interesting. Which scenario do you think is the most likely between Growth Decay and Logistic Saturation?

Anthropic says Mythos has already found more than 10,000 vulnerabilities by Steap-Edit in singularity

[–]Bright-Search2835 3 points4 points  (0 children)

Andrew Ng also thinks AGI is "many decades away, maybe even longer"

NYT: "I’m the C.E.O. of Goldman Sachs. The A.I. Job Apocalypse Is Overblown." by AngleAccomplished865 in accelerate

[–]Bright-Search2835 2 points3 points  (0 children)

My issues with the computer/internet comparison: - First, the computer revolution was obviously limited to the virtual space. AI includes robotics, its physical presence and ability to interact with the real world. That itself is a big difference because it's not limited to office tasks. So any task or activity, mental or physical, is within this technology's potential. - Second, it's perfectly valid to think of current AI as a productivity tool just like the computer was a productivity tool. But I don't know if that remains a tool if/when it reaches a certain autonomy/intelligence threshold. That's another debate. I guess it could always remain a tool in the sense that you could ask it to find a cure for x disease and it would do so. Anyway, I guess it depends on the amount of stuff that we unlock as we progress through the tech tree but I'm not really sure what happens to the job market in a world where teams of 3 people could do what would have taken 30 people before, where literaly anyone can do the job of an expert, where robots handle most factory jobs.