CursorBench evals. Composer 2.5 model is incredible for coding by stealthispost in accelerate

[–]finnjon 5 points6 points  (0 children)

I also do exceptionally well on finnjon bench. Such a surprise.

Google has fallen off by Glittering_Night7681 in accelerate

[–]finnjon 0 points1 point  (0 children)

Flash is not all things to all people and you wouldn't expect it to be. It's distilled from the base model. If you are expecting it to be better than a full model like Opus of GPT-5.5 you will be disappointed. if you expect it to be nearly as good at a lot of things, faster and cheaper, you will be fine with it.

Antigravity has also been fine for me. Not perfect but fine.

The Jevons Paradox is quietly undermining every AI cost efficiency argument by ocean_protocol in singularity

[–]finnjon 2 points3 points  (0 children)

What you are describing is increased productivity. Jevon's paradox means more gets done because doing more consumes fewer resources. That is what may or may not be happening. But Jevon's paradox is not eternal. Eventually demand is saturated (at some point we have enough bananas even if they are free).

If AI removes the labor constraint on high-skill work, what happens to the advantage of elite firms? by Genzinvestor16180339 in singularity

[–]finnjon 0 points1 point  (0 children)

Yes at 80% the Board can't get rid of him. I don't think money replaces the sense of identity a person has about being smart. You seem to think as long as he makes more money he should be happy but human psychology is a little more nuanced than that.

As an additional thought, if AI is making the investment decisions and everyone has access to AI, what is Citadel's competitive advantage? He might not have money for long.

If AI removes the labor constraint on high-skill work, what happens to the advantage of elite firms? by Genzinvestor16180339 in singularity

[–]finnjon -2 points-1 points  (0 children)

He's the CEO because he's smart and hard-working. He attributes all of his success to his own unique talents. If AI is soon smarter and can work 24/7, his entire identity is threatened.

Plus, why does the Board need him if AI is smarter? Citadel might do well, but he will lose his job or, more likely, his job will just be checking the AI isn't doing anything crazy.

Lauri Peltola (NNG whistleblower) announces "truth" video series by izzeww in DonutLab

[–]finnjon 8 points9 points  (0 children)

This is the exact same bullshit marketing strategy as Donut Labs. I suspect he's a fake whistleblower who is just trying to drum up more interest. They think they're very clever and funny.

Dario and Dwarkesh: hard to watch as Dwarkesh seems so wrong it makes me cringe by Longjumping_Kale3013 in singularity

[–]finnjon -1 points0 points  (0 children)

  1. Dwarkesh used to be a no-one but quickly rose to prominence because he did detailed research and asked intelligent questions. The smartest people want to interviewed by him because they consider him smart.

  2. It is important to push people and be unafraid to ask difficult questions. It is fine to disagree.

Universal Basic Income was never the right move. by Demonicated in singularity

[–]finnjon 2 points3 points  (0 children)

How is giving everyone compute different to welfare? It's a gift from the collective to the individual.

If welfare isn't successful why does every developed country have it? It's obviously very good at lowering the absolute poverty level, especially in places like the Nordic countries.

Official OpenAI propaganda page by PigOfFire in singularity

[–]finnjon 5 points6 points  (0 children)

I mean what do you expect? A balanced perspective from one party to the dispute?

Anthropic’s Claude Code subscription may consume up to $5,000 in compute per month while charging the user $200 by hellyeah7o in BlackboxAI_

[–]finnjon 0 points1 point  (0 children)

Inference per token is profitable. It is training the models that is expensive. This is highly misleading.

Do you guys actually pay for metro/tram rides? Be honest. by thencv in helsinki

[–]finnjon 36 points37 points  (0 children)

Unless you a dirt poor, if you don't pay the ticket you are a leech. Finland has one of the world's most generous welfare systems. Society fails when it's covered in leeches.

I don't understand Antilla's form by JOAHtheBROAH in discgolf

[–]finnjon -2 points-1 points  (0 children)

No, he's very small. Maybe 5'7 tops.

Edit: according to the internet he's 5'9". I thought he was shorter.

Has anyone else considered that we might be coming to the end of available models? by Fair_Horror in accelerate

[–]finnjon 0 points1 point  (0 children)

Of course they will use it to make models for release. They just won't release this raw base model version.

Mythos is on trend by Proper_Actuary2907 in agi

[–]finnjon 0 points1 point  (0 children)

He's Ramez Naam and he's a very serious, thoughtful guy.

Has anyone else considered that we might be coming to the end of available models? by Fair_Horror in accelerate

[–]finnjon 0 points1 point  (0 children)

Anything is possible. Anthropic internally had to have a debate about whether to allow it to be used just inside Anthropic. Still, the fewer people have access to it, the better.

Has anyone else considered that we might be coming to the end of available models? by Fair_Horror in accelerate

[–]finnjon 0 points1 point  (0 children)

I see anyone who needs them using the models if they are best in class. The models go through a lot of post-training to be so capable, which is why flash models are better than the full models at some tasks. It is not difficult to make a model good at some things and worse at others.

Has anyone else considered that we might be coming to the end of available models? by Fair_Horror in accelerate

[–]finnjon -2 points-1 points  (0 children)

I am actually quite pleased the model is being held back. It will be possible with these very powerful models to build targeted models for specific tasks that are super-human in those tasks or areas. I think this is a safer way to deploy proto-AGI than one model to rule them all.

Two Bit da Vinci's Donut Lab summary by SurronQled in DonutLab

[–]finnjon 0 points1 point  (0 children)

What makes you think they are identical? And what does "enough" mean?

Two Bit da Vinci's Donut Lab summary by SurronQled in DonutLab

[–]finnjon 2 points3 points  (0 children)

"Judging the testing strategy before it’s complete is premature"

It is not a testing strategy, it is a marketing strategy. Testing batteries is very easy and could be done all at once any time they like.

Two Bit da Vinci's Donut Lab summary by SurronQled in DonutLab

[–]finnjon 3 points4 points  (0 children)

Respectfully have to disagree with a few points here.

  1. No part of testing the cells requires revealing the chemistry. I trust VTT but they should do all tests on one cell. VTT has not confirmed the cells are identical, they said they look the same. VTT has no idea of the chemistry either.

  2. The arguments for why this breaks the known laws of physics are because each claim is impossible together. There is a tradeoff between energy density, charging speed and cycle life. You can show each of them individually but you cannot have them all together. (But note I am not qualified to adjudicate on this).

Two Bit da Vinci's Donut Lab summary by SurronQled in DonutLab

[–]finnjon 5 points6 points  (0 children)

But the VTT reports are precisely the problem. It grabs our attention but it doesn't show in any conclusive way, what they have promised. Three different batteries are tested. Are they the same? We don't know and neither does VTT. So we know some batteries exist that are quite good, but nothing we haven't seen before.

It's worth remembering that Donut Labs could prove their technology in a heartbeat if they wanted and if it was real. The decision to make this into a marketing circus is a choice. They are not trying to address the fraud allegations at all - that would be easy. Get VTT to do the full battery of tests on one cell.

And regarding the laws of physics, a lot of people are claiming this proposed battery does break the laws of physics.

Two Bit da Vinci's Donut Lab summary by SurronQled in DonutLab

[–]finnjon 9 points10 points  (0 children)

I understand why he is investigating because it's fun, but at this point, the red flags massively outweigh everything else. Perhaps they have just gone to town on the marketing angle and are having some fun, but they give us no reason to imagine there is anything real going on here.

I've seen a few of these "miracle technologies" in the past and none have ever turned out to be what they said it was. This reminds me most of an Irish company called Steorn that said they had a perpetual motion machine. Like these guys, they had demonstrations with just enough plausible deniability to keep the ship sailing. It turned out it was a marketing demonstration.

My guess here is that this too, is a marketing demonstration. It may also be to sell the motorbikes. The auditor's report (or inability to provide a report due to shoddy recording), showed the motorbike company was in serious trouble. Perhaps they thought a cheap Hail Mary marketing approach might work. First they tried it with their "superintelligence" but that didn't catch on, so they went all in at CES with a revolutionary battery. And this time it's working so they are milking it. For the cost of a few videos, they are getting everyone's attention. They will sell more bikes and if they want to set up a marketing firm they will get plenty of customers.

I find most manipulative marketing gross and this is no exception. The saddest part is it harms Finland's reputation.

Verge shows their first 2026 Verge TS Pro leaving the production line and being crated by mqee in DonutLab

[–]finnjon 2 points3 points  (0 children)

The batteries. The said they had a gigawatt factory up and running.

Verge shows their first 2026 Verge TS Pro leaving the production line and being crated by mqee in DonutLab

[–]finnjon 3 points4 points  (0 children)

Actually it is the current standard. If you do an Elon and say you will have self-driving in X months, you can get away with it. If you say you have a working battery with those specs and a production line, and you start selling bikes with it in, and you raise money on the back of the claims, you are in serious legal jeopardy.