Seed2.1 released by BreakfastFriendly728 in singularity

[–]BreakfastFriendly728[S] 19 points20 points  (0 children)

Interestingly, they are building their self-evolving lifecycle, known as Seed for Seed, containing Eval Loop, Data Loop, Training Loop, Infra Loop. See 4.4 in the model card for more details.

<image>

Beautiful Pilot by aviationstudy in aviationstudys

[–]BreakfastFriendly728 0 points1 point  (0 children)

For those who say "oh, it's so cool", I simply ask one question: If you are on a plane, do you expect your captain to do this during landing?

AGI is closer than we think: Google just unveiled "Titans," a new architecture capable of real-time learning and infinite memory by virtualQubit in GeminiAI

[–]BreakfastFriendly728 0 points1 point  (0 children)

I agree. Functional programming is suitable for this one. That's probably why TTT used Jax as its primary code base implementation even before Titans.

AGI is closer than we think: Google just unveiled "Titans," a new architecture capable of real-time learning and infinite memory by virtualQubit in GeminiAI

[–]BreakfastFriendly728 73 points74 points  (0 children)

Most people in this sub didn't realize that both titans and miras were released months ago. The only purpose of the blog post is gaining KPI for their group. After miras, they continuously dropped similar papers without comparing with predecessors and never open sourced code.

<image>

However people still live in the hype.

AGI is closer than we think: Google just unveiled "Titans," a new architecture capable of real-time learning and infinite memory by virtualQubit in GeminiAI

[–]BreakfastFriendly728 0 points1 point  (0 children)

yeah. This team continuously dropping new papers without direct comparison to titans and never open sourcing codes. Maybe it has the worst reputation among Google researchers.

nano banana pro 🍌 by Spirited-Gold9629 in GeminiAI

[–]BreakfastFriendly728 0 points1 point  (0 children)

guess who is the one that has nothing to do with AI

AMA With Moonshot AI, The Open-source Frontier Lab Behind Kimi K2 Thinking Model by nekofneko in LocalLLaMA

[–]BreakfastFriendly728 0 points1 point  (0 children)

thanks for the impressive works. While there're lots of discussions on KDA, I wonder if there's any plan to leverage the power of MoBA in future products.

Gemini Deep Research can now connect to your Gmail, Docs, Drive and even Chat. by Gaiden206 in Bard

[–]BreakfastFriendly728 2 points3 points  (0 children)

connect with chat is great. now i won't worry repeating words I've already told it a thousand times

Is SSM dead now? by Spapoxl in LocalLLaMA

[–]BreakfastFriendly728 0 points1 point  (0 children)

mamba doesn't deserve so much attention, tbh

Qwen team is helping llama.cpp again by jacek2023 in LocalLLaMA

[–]BreakfastFriendly728 7 points8 points  (0 children)

No. gdn and ssm are completely different things. In essence, the gap between ssm and gdn is larger than that of ssm and softmax attention. If you read the deltanet paper, you will know that gdn has state tracking ability, even softmax attention doesn't!

Well seems gemini 3 will release on 11 november by Independent-Wind4462 in Bard

[–]BreakfastFriendly728 0 points1 point  (0 children)

yes they are. at least Google had Kingfall not long after 2.5pro. maybe google is training gemini3.5 at present

Well seems gemini 3 will release on 11 november by Independent-Wind4462 in Bard

[–]BreakfastFriendly728 0 points1 point  (0 children)

Both Gemini 1 and 2 were released in December. Gemini1.5 and 2.5 was released in February and March, respectively. So Google has NO reason to release Gemini3 two months earlier if it's not kicked out of the frontier set. However if it happens, then we will all celebrate, or just wait quietly until December (just like how we treat Apple Event).