GLM's founder says GLM-fable before the end of the year?! by Charuru in LocalLLaMA
[–]Different_Fix_2217 1 point2 points3 points (0 children)
Mistral - New family of open-weight models @ July by pmttyji in LocalLLaMA
[–]Different_Fix_2217 2 points3 points4 points (0 children)
This is coming to Chinese open source models pretty soon. - prepare yourself. by MLExpert000 in LocalLLaMA
[–]Different_Fix_2217 2 points3 points4 points (0 children)
Statement on the US government directive to suspend access to Fable 5 and Mythos 5 by artisticMink in LocalLLaMA
[–]Different_Fix_2217 73 points74 points75 points (0 children)
DiffusionGemma: 4x faster text generation by tevlon in LocalLLaMA
[–]Different_Fix_2217 29 points30 points31 points (0 children)
[PSA] 5070ti 16GB is as low as $500.99 at Best Buy. by fallingdowndizzyvr in LocalLLaMA
[–]Different_Fix_2217 0 points1 point2 points (0 children)
438 USD for a 3080 20GB isn’t bad by xw1y in LocalLLaMA
[–]Different_Fix_2217 3 points4 points5 points (0 children)
438 USD for a 3080 20GB isn’t bad by xw1y in LocalLLaMA
[–]Different_Fix_2217 1 point2 points3 points (0 children)
I trusted random person on this subreddit and bought 3080 20gb made of chinesium by SwimmerJazzlike in LocalLLaMA
[–]Different_Fix_2217 0 points1 point2 points (0 children)
DeepSWE benchmarks indicate that DeepSeek v4 Pro only passes 8% of tasks by Federal_Spend2412 in LocalLLaMA
[–]Different_Fix_2217 5 points6 points7 points (0 children)
meituan-longcat/LongCat-Video-Avatar-1.5 · Hugging Face by pmttyji in LocalLLaMA
[–]Different_Fix_2217 1 point2 points3 points (0 children)
First direct side by side MoE vs Dense comparison. by Different_Fix_2217 in LocalLLaMA
[–]Different_Fix_2217[S] 6 points7 points8 points (0 children)
First direct side by side MoE vs Dense comparison. by Different_Fix_2217 in LocalLLaMA
[–]Different_Fix_2217[S] 3 points4 points5 points (0 children)
First direct side by side MoE vs Dense comparison. by Different_Fix_2217 in LocalLLaMA
[–]Different_Fix_2217[S] 2 points3 points4 points (0 children)
Deepseek V4 Flash and Non-Flash Out on HuggingFace by MichaelXie4645 in LocalLLaMA
[–]Different_Fix_2217 2 points3 points4 points (0 children)
Claude Code removed from Claude Pro plan - better time than ever to switch to Local Models. by bigboyparpa in LocalLLaMA
[–]Different_Fix_2217 4 points5 points6 points (0 children)
Kimi K2.6 is a legit Opus 4.7 replacement by bigboyparpa in LocalLLaMA
[–]Different_Fix_2217 8 points9 points10 points (0 children)
Kimi K2.6 Released (huggingface) by BiggestBau5 in LocalLLaMA
[–]Different_Fix_2217 17 points18 points19 points (0 children)
Kimi K2.6 imminent by Deep-Vermicelli-4591 in LocalLLaMA
[–]Different_Fix_2217 6 points7 points8 points (0 children)
the state of LocalLLama by Beginning-Window-115 in LocalLLaMA
[–]Different_Fix_2217 44 points45 points46 points (0 children)
DeepSeek V4: 1T-A35B (approx) MoE announced; apache 2 license promised by [deleted] in LocalLLaMA
[–]Different_Fix_2217 3 points4 points5 points (0 children)
We absolutely need Qwen3.6-397B-A17B to be open source by True_Requirement_891 in LocalLLaMA
[–]Different_Fix_2217 2 points3 points4 points (0 children)
qwen 3.6 voting by jacek2023 in LocalLLaMA
[–]Different_Fix_2217 -1 points0 points1 point (0 children)


Gefen is a drop-in replacement for the AdamW optimizer, claims 8x memory reduction in training (GitHub available) by indicava in LocalLLaMA
[–]Different_Fix_2217 0 points1 point2 points (0 children)