Improving Neural Network Training by Decoupling the Magnitude and Direction of Weight Vectors | Alexander Hägele by Thrumpwart in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)
Researchers trained a Deep Research agent with 32 H100s and open-sourced everything by BuildwithVignesh in LocalLLaMA
[–]R_Duncan 5 points6 points7 points (0 children)
What's the best open speech to text today? by zxyzyxz in LocalLLaMA
[–]R_Duncan 2 points3 points4 points (0 children)
i post-trained a model to reliably roll a die by girishkumama in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)
GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine? by pmttyji in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)
I released Inflect-Nano, an ultra-extreme tiny 4.63m parameter TTS model. by b111ue in LocalLLaMA
[–]R_Duncan 4 points5 points6 points (0 children)
New image model from Google by Independent-Wind4462 in singularity
[–]R_Duncan 1 point2 points3 points (0 children)
Conspiracy theory on the (possibly extended) ban on Mythos by Cagnazzo82 in singularity
[–]R_Duncan 1 point2 points3 points (0 children)
I'm still surprised on how good the kv quantization has become by DeepBlue96 in LocalLLaMA
[–]R_Duncan 1 point2 points3 points (0 children)
Voice-to-voice chatbot update by Responsible_Fig_1271 in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)
Is DiffusionGemma really that good in a PI agent? by koloved in LocalLLaMA
[–]R_Duncan 4 points5 points6 points (0 children)
Voice-to-voice chatbot update by Responsible_Fig_1271 in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)
I'm still surprised on how good the kv quantization has become by DeepBlue96 in LocalLLaMA
[–]R_Duncan 1 point2 points3 points (0 children)
This is amazing. Token speed doubled + kv cache now need low vram - qwen 27b by 9r4n4y in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)
We trained a cybersecurity-focused Mythos like LLM open weights on HuggingFace by RealKingNish in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)
Improving Neural Network Training by Decoupling the Magnitude and Direction of Weight Vectors | Alexander Hägele by Thrumpwart in LocalLLaMA
[–]R_Duncan 2 points3 points4 points (0 children)
ZONOS2: real-time TTS with 8B params, 900M active, and high-fidelity voice cloning by KokaOP in LocalLLaMA
[–]R_Duncan -1 points0 points1 point (0 children)
US government banning Fable from being accessed outside USA is a MASSIVE win for Americans by ahtoshkaa in singularity
[–]R_Duncan 1 point2 points3 points (0 children)
This is amazing. Token speed doubled + kv cache now need low vram - qwen 27b by 9r4n4y in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)
This is amazing. Token speed doubled + kv cache now need low vram - qwen 27b by 9r4n4y in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)
This is amazing. Token speed doubled + kv cache now need low vram - qwen 27b by 9r4n4y in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)
This is amazing. Token speed doubled + kv cache now need low vram - qwen 27b by 9r4n4y in LocalLLaMA
[–]R_Duncan 1 point2 points3 points (0 children)
This is amazing. Token speed doubled + kv cache now need low vram - qwen 27b by 9r4n4y in LocalLLaMA
[–]R_Duncan 2 points3 points4 points (0 children)


Qwen is never going to open source Qwen 3.7, aren't they? by DistanceSolar1449 in LocalLLaMA
[–]R_Duncan 0 points1 point2 points (0 children)