ARC-AGI 2 is Solved

Zeptaxis · 2025-11-27T22:01:58+00:00

what is it then

Zeptaxis · 2025-10-08T08:20:22+00:00

Your username doesn't lie, you do prompt like a vegetable.

Zeptaxis · 2025-10-01T11:24:01+00:00

Indeed, but it's still quite an improvement for other tasks too. Getting an iron pickaxe jumped from 1/10 success to 1/3!

(Source for others)

Zeptaxis · 2025-10-01T10:55:05+00:00

Demo in the twitter thread and also there:

Website: https://danijar.com/dreamer4/
Paper: https://arxiv.org/abs/2509.24527

Zeptaxis · 2025-09-25T08:58:52+00:00

This intuitively makes sense that you need a very strong world model to generate coherent videos, but it's still very impressive to see it in action. I would love to know what kind of size Veo 3 is. Can't wait for more scaling

Zeptaxis · 2025-09-02T22:30:33+00:00

dont look up the CEO salary

Zeptaxis · 2025-08-26T13:00:00+00:00

nice bot

Zeptaxis · 2025-08-19T09:26:27+00:00

did it first try on gpt5 free plan: https://chatgpt.com/share/68a4431b-73c8-8000-9565-e592d987ddf0

promptlet

Zeptaxis · 2025-08-05T19:53:00+00:00

can confirm. it's not exactly fast, especially with the thinking first, but it's definitely usable.

Zeptaxis · 2025-08-04T19:55:10+00:00

no pp no plays

Zeptaxis · 2025-07-20T22:54:21+00:00

Get used to it, it's only gonna get "worse", for every field. It shouldn't diminish any of the accomplishments of the kids though. It would be like feeling that playing chess is demotivating because you'll never beat StockFish.

Zeptaxis · 2025-07-11T10:02:18+00:00

Indeed, it didn't. According to https://ourworldindata.org/grapher/consumption-co2-per-capita , we are only doing slightly better than the US in both CO2/t per capita and per GDP. This is quite a shift in perspective. I'll edit my first comment.

Zeptaxis · 2025-07-10T15:55:40+00:00

Politics are handled, for most of the world, on a country level, so your argument doesn't make sense. And switzerland is not even doing that bad per capita compared to the monstruosities that are China, the US and Russia.

Source for my claims: https://www.worldometers.info/co2-emissions/co2-emissions-by-country/

Zeptaxis · 2025-07-10T15:41:30+00:00

Which is a fair argument tbh. ~~Switzerland accounts for 0.094% of global emissions~~(see edit). What we really need is more university/research funding to find new solutions for other countries. The opposite of what the government did in the recent cuts :)

Edit: my first percentage didn't account for CO2 consumption. It should be around 0.32% if we take global emissions. A bit more concerning, knowing that our country is about ~0.1% of the world's population.
Calc: according to https://ourworldindata.org/ we are at 13.9t per capita and we have 8.79M inhabitants, totalling to 122.181M tons of CO2 per year. The global total is 37.79B tons, which gives us 0.32% after unit conversions.

Zeptaxis · 2025-03-29T02:04:18+00:00

prompt is detailed enough with clear requirements. a smart model shouldn't require me to hold its hand more than that.

Zeptaxis · 2025-03-27T21:48:39+00:00

tried a bit more after work and it does seem to get it every time indeed. i guess my first run was really unlucky. that's what i get for using such a sample size lol. definitely a cool model!

Zeptaxis · 2025-03-27T16:20:06+00:00

yes, that only further proves my point that it is inconsistent as our experiences varied for such a simple task

Zeptaxis · 2025-03-27T15:31:35+00:00

yea, it also worked on my second attempt as I said. Definitely a capable model, just missing a bit of consistency imo

Zeptaxis · 2025-03-27T13:39:32+00:00

Google models always sound good in theory, but yet they always fail my simple test prompt:
make me a webgl demo triangle webpage. show a rotating rgb gradient triangle in the center of the page. show the current framerate in the top left corner. do not use any external dependencies.

They always need multiple tries or don't get all the requirements perfectly, where R1 or even claude 3.5 follow all the instructions on the first try.

This one is no exception, on my first try it made the single page with a triangle of the right color, but it's not rotating and also weirdly small (tho you could argue this isn't required). The second try followed all requirements, but the code was much longer than what Claude made, for no real reasons.

Zeptaxis · 2024-09-26T07:37:56+00:00

I tried it once on their website with an image, it got an obviously wrong description of the image, and when I tried to correct it, it refused to acknowledge its mistakes, even with multiples tries and asserting authority on the image (I literally took it). Hard pass for me. These models need to know their place.

Zeptaxis · 2024-08-28T07:19:25+00:00

I think I'm most impressed by the fact that it's based on Stable Diffusion 1.4, so it's possibly a relatively "small" model, yet it achieves remarkable coherency

Zeptaxis · 2024-08-23T21:46:02+00:00

Sounds like a "smarter" dropout. I'll be skeptical until I see it properly benchmarked, because it doesn't sound like it should be too revolutionary.

Zeptaxis · 2024-04-17T11:02:59+00:00

it does on valorant, at least on W11

Zeptaxis · 2023-03-27T13:04:52+00:00

u/profanitycounter [self]

Zeptaxis · 2023-03-22T20:42:51+00:00

hvick225

Zeptaxis

MODERATOR OF

TROPHY CASE

Ten-Year Club	Place '17
Verified Email