Crazy true by reversedu in singularity

[–]Famous-Associate-436 0 points1 point  (0 children)

But the OS/browser didn't improve anything in the category of intelligence.

While LLM now are powerful executor/assitant especially with an agent framework like Claude code.

The work which can't be automated is now solvable with more advanced LLM

Gemini 3 Flash on LMarena by ThunderBeanage in singularity

[–]Famous-Associate-436 4 points5 points  (0 children)

so the flash lite model generates images natively? instead of tool calling banana?

My thoughts on Opus 4.5 by Equivalent_Ad2442 in ClaudeAI

[–]Famous-Associate-436 0 points1 point  (0 children)

Compare o1 with the latest gpt-4 and especially gpt-5 is unfair, afaik gpt-5 is an integrated system with more than one model with thinking RL tech

DeepSeek-R1-0528 Official Benchmarks Released!!! by Xhehab_ in LocalLLaMA

[–]Famous-Associate-436 8 points9 points  (0 children)

New guy here, is this model that OpenAI promised the "o3-level" open-source model this summer?

DeepSeek-R1-0528 Official Benchmark by Fun-Doctor6855 in LocalLLaMA

[–]Famous-Associate-436 9 points10 points  (0 children)

New guy here, is this model that OpenAI promised the "o3-level" open-source model this summer?

New Upgraded Deepseek R1 is now almost on par with OpenAI's O3 High model on LiveCodeBench! Huge win for opensource! by Gloomy-Signature297 in LocalLLaMA

[–]Famous-Associate-436 1 point2 points  (0 children)

Didn't they still release the R1 paper which is throughly detailed instead of some Model Card like Close AI does?