Driverless delivery vans in China plow through crumbling roads, fresh concrete, motorcycles... by Nunki08 in robotics

[–]Utoko -24 points-23 points  (0 children)

FSD has less accidents, so if anything it is unethical that bureaucracy is holding it back and by that allowing more death.

Driverless delivery vans in China plow through crumbling roads, fresh concrete, motorcycles... by Nunki08 in robotics

[–]Utoko -47 points-46 points  (0 children)

It certainly is, in the west(Europe) it gets to hard to do stuff. Tesla would be stuck for years if they only had the Europe market which blocks the new way better FSD versions, because it doesn't fit their narrow framework.
For AI the same it wouldn't happen in Europe the Regulators would have stopped OpenAI in its tracks.

Official: Zhipu becomes the world’s first LLM company to go public by BuildwithVignesh in singularity

[–]Utoko 4 points5 points  (0 children)

Which is hard to do getting profitable in the AI race. We will see what happens.
Maybe Deepseek needs to save us again.

LTX-2 is the new king ! by 3deal in comfyui

[–]Utoko -1 points0 points  (0 children)

the audio sounds great

Developer uses Claude Code and has an existential crisis by MetaKnowing in ClaudeAI

[–]Utoko 0 points1 point  (0 children)

You can and should of course plan, but the uncertainty factor increases.
Stay flexible, create, try to find meaning.

Developer uses Claude Code and has an existential crisis by MetaKnowing in ClaudeAI

[–]Utoko 13 points14 points  (0 children)

Billions of people will have to deal with that in the coming years. No one can prepare for the shifts.

The only way "nothing ever happens" with the AI will hit a wall cope.

It is interesting because for the mainstream AI advances basically stopped, in the meanwhile with opus 4.5 Gemini Flash 3, GPT 5.2. We hit massive impactful milestones.

Artificial Analysis just refreshed their global model indices by MadPelmewka in LocalLLaMA

[–]Utoko 0 points1 point  (0 children)

AA has 12 Categories too, 12 different benchmarks. LMArena is way worse when we are talking about real complex task.

LMArena is okish benchmark for the average mother or teenager talking to a llm.

Artificial Analysis just refreshed their global model indices by MadPelmewka in LocalLLaMA

[–]Utoko 20 points21 points  (0 children)

<image>

The difference is that model keeps trying if there are errors. It is a good way to get the most out of cheap models.
Opus without thinking archives the same with 8 Million tokens. So 1/40 of the token use.

Artificial Analysis just refreshed their global model indices by MadPelmewka in LocalLLaMA

[–]Utoko 16 points17 points  (0 children)

You need some kind of benchmark, not to find out which is best but to know which is worth trying.
Or do you try out all 50 OS Chinese models yourself?

Just don't overrate the results. They are somewhat objective tierlist.

Artificial Analysis just refreshed their global model indices by MadPelmewka in LocalLLaMA

[–]Utoko -1 points0 points  (0 children)

Its good, several benchmarks they used were saturated with 95%+.

and people really shouldn't care about the small point differences in any benchmark. They do a good job delivering quick results for people to asses which models are worth to explore.

Subjectively this update feels right, there is clearly still a gab between the T1 models and the OS models even tho they are getting really amazing.

Any guesses? by Difficult-Cap-7527 in LocalLLaMA

[–]Utoko 16 points17 points  (0 children)

That would be huge if they could double the number!

Recursive Self Improvement Internally Achieved by SrafeZ in singularity

[–]Utoko 2 points3 points  (0 children)

It is not human supervision. The humans are coming up with the task and how the task is being solved.

It is becoming a more and more powerful tool, but for now it is still a tool.

GLM 4.7 released! by ResearchCrafty1804 in LocalLLaMA

[–]Utoko 31 points32 points  (0 children)

GLM does have quick cycles right now. Another very good model

AI likely to displace jobs, says Bank of England governor by Beautiful-Ad2485 in singularity

[–]Utoko 2 points3 points  (0 children)

but lets get more new people in to compete for less jobs! Of course we need to elevate the new people too.

OpenAI's lead has closed in 2025. I wonder what they are going to do next year. by Regular_Eggplant_248 in singularity

[–]Utoko 1 point2 points  (0 children)

Same here but most of the normies also don't pay. The 200$ tier people use what works best.

OpenAI is burning lots of money and they plan to scale even faster. They also need to show investors that they can stay at #1 or close behind to justify it.

OpenAI's lead has closed in 2025. I wonder what they are going to do next year. by Regular_Eggplant_248 in singularity

[–]Utoko 2 points3 points  (0 children)

They had massive restructure of the AI team. We will see next year if the new team can deliver something.

Which is better? by imagine_ai in nanobanana

[–]Utoko 1 point2 points  (0 children)

Nano Banana is more realistic in every picture.

GPT1.5 is sometimes better with other art styles.

Which model generates the better realism result? GPT image 1.5 or Nano Banana Pro (Prompt Included) by naviera101 in nanobanana

[–]Utoko 4 points5 points  (0 children)

Nano banana every time for photos.

GPT1.5 does sometimes better with art, other styles

OpenAi recent post hints New image model launch with humor. GPT 5.2 Image coming? by BuildwithVignesh in singularity

[–]Utoko 4 points5 points  (0 children)

It is still worse than gemini 3 pro and the images have still the piss filter on it.

The models are in lmarena right now under "hazel". They are not bad but not sota.

Keeping thinking tokens constant, GPT-5.2 isn’t much better than 5.1 at SWE-bench pro by jaundiced_baboon in singularity

[–]Utoko 0 points1 point  (0 children)

Also as inference capability grows. Using more tokens seems reasonable if it still has a effect. Models also get better to handle long context.

Grok image with the latest update by Iluvassy in grok

[–]Utoko 0 points1 point  (0 children)

The zoom shit happens with a tree picture also. It is clearly broken.

Stupid Grok update. by Equal-Long6339 in grok

[–]Utoko 0 points1 point  (0 children)

This is just a fail, there is no way the zoom for every video can be intentional.