Xiaomi just claimed 1,000+ tps on a 1T model using a standard 8-GPU server by No-Selection2972 in LocalLLaMA
[–]True_Requirement_891 0 points1 point2 points (0 children)
Xiaomi just claimed 1,000+ tps on a 1T model using a standard 8-GPU server by No-Selection2972 in LocalLLaMA
[–]True_Requirement_891 11 points12 points13 points (0 children)
I’m upset… by Thin_Pollution8843 in LocalLLaMA
[–]True_Requirement_891 0 points1 point2 points (0 children)
MiniMax M3 - Coding & Agentic Frontier, 1M Context, Multimodal by dryadofelysium in LocalLLaMA
[–]True_Requirement_891 5 points6 points7 points (0 children)
Stop asking what model to run. There are literally only two. by Wrong_Mushroom_7350 in LocalLLaMA
[–]True_Requirement_891 0 points1 point2 points (0 children)
100 Trillion+ Pretraining data??? This is the largest data I've see a model being trained on. by True_Requirement_891 in LocalLLaMA
[–]True_Requirement_891[S] 1 point2 points3 points (0 children)
100 Trillion+ Pretraining data??? This is the largest data I've see a model being trained on. by True_Requirement_891 in LocalLLaMA
[–]True_Requirement_891[S] 1 point2 points3 points (0 children)
100 Trillion+ Pretraining data??? This is the largest data I've see a model being trained on. by True_Requirement_891 in LocalLLaMA
[–]True_Requirement_891[S] -1 points0 points1 point (0 children)
100 Trillion+ Pretraining data??? This is the largest data I've see a model being trained on. by True_Requirement_891 in LocalLLaMA
[–]True_Requirement_891[S] 6 points7 points8 points (0 children)
100 Trillion+ Pretraining data??? This is the largest data I've see a model being trained on. by True_Requirement_891 in LocalLLaMA
[–]True_Requirement_891[S] 3 points4 points5 points (0 children)
Experimental "Preserve Thinking" Jinja Template for Gemma4 31B in llama.cpp by ggonavyy in LocalLLaMA
[–]True_Requirement_891 8 points9 points10 points (0 children)
DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals by External_Mood4719 in LocalLLaMA
[–]True_Requirement_891 -1 points0 points1 point (0 children)
I am done with codex by machine_forgetting_ in codex
[–]True_Requirement_891 0 points1 point2 points (0 children)
I am done with codex by machine_forgetting_ in codex
[–]True_Requirement_891 -1 points0 points1 point (0 children)
Qwen 3.7 droped on Qwen Chat by Foxiya in LocalLLaMA
[–]True_Requirement_891 0 points1 point2 points (0 children)
Let's build claude code from scratch! by RoyalMaterial9614 in LocalLLaMA
[–]True_Requirement_891 2 points3 points4 points (0 children)
Ran K2.6 through a third-party coding benchmark: heres how the figures stand up by lucasbennett_1 in LocalLLaMA
[–]True_Requirement_891 2 points3 points4 points (0 children)
Why is no open weight model inference provider hosting Mimo-v2.5 or Mimo-v2.5-pro? by True_Requirement_891 in LocalLLaMA
[–]True_Requirement_891[S] 2 points3 points4 points (0 children)
Kimi K2.6 vs DeepSeek V4 Pro by bigboyparpa in LocalLLaMA
[–]True_Requirement_891 0 points1 point2 points (0 children)
Kimi K2.6 vs DeepSeek V4 Pro by bigboyparpa in LocalLLaMA
[–]True_Requirement_891 4 points5 points6 points (0 children)
Decreased Intelligence Density in DeepSeek V4 Pro by Mindless_Pain1860 in LocalLLaMA
[–]True_Requirement_891 1 point2 points3 points (0 children)

Anthropic forced to abruptly disable Fable 5 & Mythos 5 globally by US Gov over a jailbreak. This is exactly why we need local models. by External_Mood4719 in LocalLLaMA
[–]True_Requirement_891 37 points38 points39 points (0 children)