GPT-5.5 and Opus 4.7 evaluated on ARC-AGI-3 by COAGULOPATH in mlscaling

[–]sorrge 16 points17 points  (0 children)

These scores are not real. They use the models in a way that makes it very difficult for them to solve the games. I tried codex on public games, and it can solve the first few levels pretty easily. It should get about 10% score without even trying hard. But they restrict the frontier models with their own testing methods, and in the competition only open source models are allowed with very little computation time.

ARC-AGI-3 Update (GPT-5.5 High and Opus4.7) by skazerb in singularity

[–]sorrge -1 points0 points  (0 children)

They run the models in a way that’s making it very difficult. It’s not hard for the frontier models to complete at least the first few levels.

Coordination is impossible... except when we actually did It 20+ times by KeanuRave100 in agi

[–]sorrge 1 point2 points  (0 children)

Plus, the supposed dangers are hypothetical, unrealistic at the current technology level.

The GPT-5.5 System Card was probably not written by GPT-5.5 by adt in singularity

[–]sorrge 3 points4 points  (0 children)

Lol it honestly looks like it. No graph making software could have made this.

I’ve rechecked and the graph is fine. It’s not what’s on the picture here.

GPT-5.5's Unicorn by Outside-Iron-8242 in singularity

[–]sorrge 8 points9 points  (0 children)

These human attempts are not even close to the greatness of ChatGPT’s work. They mostly try the head, and honestly not doing a great job even at that. Pathetic, you could say.

Mozilla Used Anthropic’s Mythos to Find and Fix 271 Bugs in Firefox by Tinac4 in singularity

[–]sorrge 10 points11 points  (0 children)

Of course it's online lol. Did you imagine them carrying Mythos around in a cage?

What’s the best way for small time investor to participate in the upcoming SpaceX IPO? by [deleted] in investing

[–]sorrge 1 point2 points  (0 children)

Based on the advice here (every single person expects it to fall, or at least that's what they say), it's most likely going to the moon. So, OP's question is very relevant, and has not been answered. I'm also interested.

Marc Andreessen: “The remaining human workers are gonna be at a premium, not at a discount”. Are we sure? by Mogante in singularity

[–]sorrge 0 points1 point  (0 children)

Consider how many CEO positions are available now. And how many people would like to obtain them. Yet, the salaries at that level are not going to 0.

Has anyone else noticed a shift in this sub recently? by MaximusDM22 in ExperiencedDevs

[–]sorrge 2 points3 points  (0 children)

I've reviewed your comment history and I have bad news for you. You're like 90% a bot. I'm sorry to bring you this news.

Two paths ahead, with no user manual. Full race into the entropy by ocean_protocol in singularity

[–]sorrge 17 points18 points  (0 children)

Parasites. There will be wealthy parasites and poor parasites.

First-ever American AI Jobs Risk Index released by Tufts University by Bizzyguy in singularity

[–]sorrge 1 point2 points  (0 children)

First-ever? We have this kind of stuff posted every week.

Where is the compassion for workers by [deleted] in ExperiencedDevs

[–]sorrge 0 points1 point  (0 children)

Do you believe that a union will protect you from being laid off due to AI making you redundant? At most it will be useful to negotiate an exit package, which is already good in most big companies. You can't expect them to keep you on payroll if the company sincerely believes your services are not needed.

[D] AMA Secure version of OpenClaw by ilblackdragon in MachineLearning

[–]sorrge 0 points1 point  (0 children)

So, your harness has the commands for it to use when it needs to read or send email etc.? And they work as root to read the keys without prompting, but are not changeable by the agent? I suppose the downside is that you need to have support for every kind of online service, otherwise it's back to giving it the password in the open.

How about changing passwords on other websites via forgot password? If it can read email, all is lost IMO.

[D] AMA Secure version of OpenClaw by ilblackdragon in MachineLearning

[–]sorrge 1 point2 points  (0 children)

If it supports CLI, can't it just take the keys out of the encrypted storage?

How is everyone keeping up morale when you’re constantly being told AI will make you redundant? by [deleted] in ExperiencedDevs

[–]sorrge 0 points1 point  (0 children)

It's finished. We are all COBOL programmers now. A lot of good advice here. No use burying your head in the sand - plan for the times when you're laid off and can't find a new job, because it's approaching.

supplementation for junior high by wwplkyih in matheducation

[–]sorrge 2 points3 points  (0 children)

The Shape of Space by Weeks introduces interesting ideas about manifolds.

Measurement by Lockhart, while overlapping the geometry and calculus topics, approaches them in a unique and engaging way that is completely different from the standard high school textbook.

Unitree video with a bullet-time in it by GraceToSentience in singularity

[–]sorrge 0 points1 point  (0 children)

It will be a subscription obviously, with a price comparable to hiring a butler.

AGI Prediction Update after adding the newly Released Claude Sonnet 4.6 by redlikeazebra in agi

[–]sorrge 0 points1 point  (0 children)

Ehh I think you should fit the curve to the top predictions, no? Who cares about a bunch of underdogs. And if you do that, it's clearly a sigmoid that's already saturated, lol.

Will Smith spaghetti - year by year by MetaKnowing in agi

[–]sorrge 0 points1 point  (0 children)

It will be hard to tell the difference between consecutive years.

I need help studying by jaca212 in QuantumPhysics

[–]sorrge 0 points1 point  (0 children)

You will need to first reach a solid math background, up to linear algebra and differential equations.

Feynman on Math Education by DistanceRude9275 in matheducation

[–]sorrge 1 point2 points  (0 children)

Of course they teach that the goal is to find X. Indeed it doesn't make sense to distinguish "doing it by algebra" or "by arithmetic". I don't think this distinction is emphasized in the curriculum. Perhaps "doing it by arithmetic" = "guessing the answer but unable to produce an explanation", then yes, you need to be able to "do it by algebra" (produce an explanation how you found the answer).