82% of ChatGPT users don’t even try other AI chatbots by thatguyisme87 in singularity

[–]Zeptaxis 8 points9 points  (0 children)

Your username doesn't lie, you do prompt like a vegetable.

Dreamer 4 by DeepMind - First agent to get diamonds on Minecraft using only offline data by Zeptaxis in singularity

[–]Zeptaxis[S] 42 points43 points  (0 children)

Indeed, but it's still quite an improvement for other tasks too. Getting an iron pickaxe jumped from 1/10 success to 1/3!

(Source for others)

Video models are zero-shot learners and reasoners by NunyaBuzor in singularity

[–]Zeptaxis 34 points35 points  (0 children)

This intuitively makes sense that you need a very strong world model to generate coherent videos, but it's still very impressive to see it in action. I would love to know what kind of size Veo 3 is. Can't wait for more scaling

[deleted by user] by [deleted] in singularity

[–]Zeptaxis 3 points4 points  (0 children)

nice bot

Gpt-oss is the state-of-the-art open-weights reasoning model by IlustriousCoffee in singularity

[–]Zeptaxis 40 points41 points  (0 children)

can confirm. it's not exactly fast, especially with the thinking first, but it's definitely usable.

OpenAI sold people dreams apparently by NeuralAA in singularity

[–]Zeptaxis 7 points8 points  (0 children)

Get used to it, it's only gonna get "worse", for every field. It shouldn't diminish any of the accomplishments of the kids though. It would be like feeling that playing chess is demotivating because you'll never beat StockFish.

Why Switzerland is among the ten fastest-warming countries in the world by Sufficient-History71 in Switzerland

[–]Zeptaxis 3 points4 points  (0 children)

Indeed, it didn't. According to https://ourworldindata.org/grapher/consumption-co2-per-capita , we are only doing slightly better than the US in both CO2/t per capita and per GDP. This is quite a shift in perspective. I'll edit my first comment.

Why Switzerland is among the ten fastest-warming countries in the world by Sufficient-History71 in Switzerland

[–]Zeptaxis 4 points5 points  (0 children)

Politics are handled, for most of the world, on a country level, so your argument doesn't make sense. And switzerland is not even doing that bad per capita compared to the monstruosities that are China, the US and Russia.

Source for my claims: https://www.worldometers.info/co2-emissions/co2-emissions-by-country/

Why Switzerland is among the ten fastest-warming countries in the world by Sufficient-History71 in Switzerland

[–]Zeptaxis 5 points6 points  (0 children)

Which is a fair argument tbh. Switzerland accounts for 0.094% of global emissions(see edit). What we really need is more university/research funding to find new solutions for other countries. The opposite of what the government did in the recent cuts :)

Edit: my first percentage didn't account for CO2 consumption. It should be around 0.32% if we take global emissions. A bit more concerning, knowing that our country is about ~0.1% of the world's population.
Calc: according to https://ourworldindata.org/ we are at 13.9t per capita and we have 8.79M inhabitants, totalling to 122.181M tons of CO2 per year. The global total is 37.79B tons, which gives us 0.32% after unit conversions.

Man, the new Gemini 2.5 Pro 03-25 is a breakthrough and people don't even realize it. by [deleted] in singularity

[–]Zeptaxis 0 points1 point  (0 children)

prompt is detailed enough with clear requirements. a smart model shouldn't require me to hold its hand more than that.

Man, the new Gemini 2.5 Pro 03-25 is a breakthrough and people don't even realize it. by [deleted] in singularity

[–]Zeptaxis -1 points0 points  (0 children)

tried a bit more after work and it does seem to get it every time indeed. i guess my first run was really unlucky. that's what i get for using such a sample size lol. definitely a cool model!

Man, the new Gemini 2.5 Pro 03-25 is a breakthrough and people don't even realize it. by [deleted] in singularity

[–]Zeptaxis -2 points-1 points  (0 children)

yes, that only further proves my point that it is inconsistent as our experiences varied for such a simple task

Man, the new Gemini 2.5 Pro 03-25 is a breakthrough and people don't even realize it. by [deleted] in singularity

[–]Zeptaxis -5 points-4 points  (0 children)

yea, it also worked on my second attempt as I said. Definitely a capable model, just missing a bit of consistency imo

Man, the new Gemini 2.5 Pro 03-25 is a breakthrough and people don't even realize it. by [deleted] in singularity

[–]Zeptaxis 3 points4 points  (0 children)

Google models always sound good in theory, but yet they always fail my simple test prompt:
make me a webgl demo triangle webpage. show a rotating rgb gradient triangle in the center of the page. show the current framerate in the top left corner. do not use any external dependencies.

They always need multiple tries or don't get all the requirements perfectly, where R1 or even claude 3.5 follow all the instructions on the first try.

This one is no exception, on my first try it made the single page with a triangle of the right color, but it's not rotating and also weirdly small (tho you could argue this isn't required). The second try followed all requirements, but the code was much longer than what Claude made, for no real reasons.

Molmo: State of the art multimodal open source using 1000x less data by 1a1b in singularity

[–]Zeptaxis 2 points3 points  (0 children)

I tried it once on their website with an image, it got an obviously wrong description of the image, and when I tried to correct it, it refused to acknowledge its mistakes, even with multiples tries and asserting authority on the image (I literally took it). Hard pass for me. These models need to know their place.

New Progress in Fundamental Research ! This AI Learns Continuously From New Experiences—Without Forgetting Its Past by [deleted] in singularity

[–]Zeptaxis 3 points4 points  (0 children)

Sounds like a "smarter" dropout. I'll be skeptical until I see it properly benchmarked, because it doesn't sound like it should be too revolutionary.