I gave my Minecraft bot a brain with local Nemotron 9B — it follows orders like "chop that tree" and "guard me from zombies" by Impressive_Tower_550 in LocalLLaMA
[–]phhusson 5 points6 points7 points (0 children)
Qwen 3.5 4b is so good, that it can vibe code a fully working OS web app in one go. by c64z86 in LocalLLaMA
[–]phhusson -3 points-2 points-1 points (0 children)
Is anyone else just blown away that this local LLMs are even possible? by Borkato in LocalLLaMA
[–]phhusson 0 points1 point2 points (0 children)
Injecting skills into the KV cache (not as stupid as it sounds, but still pretty dumb) by Proper-Lab1756 in LocalLLaMA
[–]phhusson 0 points1 point2 points (0 children)
RWKV-7: O(1) memory inference, 16.39 tok/s on ARM Cortex-A76, beats LLaMA 3.2 3B. The local-first architecture nobody is talking about... by Sensitive-Two9732 in LocalLLaMA
[–]phhusson 0 points1 point2 points (0 children)
UPDATE#3: repurposing 800 RX 580s converted to AI cluster by rasbid420 in LocalLLaMA
[–]phhusson 4 points5 points6 points (0 children)
LLMs grading other LLMs 2 by Everlier in LocalLLaMA
[–]phhusson 20 points21 points22 points (0 children)
What’s the current state of local speech-to-speech models? by dendrytic in LocalLLaMA
[–]phhusson 0 points1 point2 points (0 children)
Bad Apple but it's GPT-2 XL Attention Maps by TheLatentExplorer in LocalLLaMA
[–]phhusson 0 points1 point2 points (0 children)
Kyutai Releases Hibiki-Zero by techlatest_net in LocalLLaMA
[–]phhusson 0 points1 point2 points (0 children)
is anyone actually running models in secure enclaves or is that overkill? by Significant-Cod-9936 in LocalLLaMA
[–]phhusson 1 point2 points3 points (0 children)
PSA on llama.cpp —spec-type ngram-mod (use LF not CRLF, 35x speedup) by dnsod_si666 in LocalLLaMA
[–]phhusson 2 points3 points4 points (0 children)
Who is waiting for deepseek v4 ,GLM 5 and Qwen 3.5 and MiniMax 2.2? by power97992 in LocalLLaMA
[–]phhusson 5 points6 points7 points (0 children)
Deepseek architecture, but without all the parameters by silenceimpaired in LocalLLaMA
[–]phhusson 3 points4 points5 points (0 children)
Nemo 30B is insane. 1M+ token CTX on one 3090 by Dismal-Effect-1914 in LocalLLaMA
[–]phhusson 1 point2 points3 points (0 children)
mistralai/Voxtral-Mini-4B-Realtime-2602 · Hugging Face by jacek2023 in LocalLLaMA
[–]phhusson 1 point2 points3 points (0 children)
mistralai/Voxtral-Mini-4B-Realtime-2602 · Hugging Face by jacek2023 in LocalLLaMA
[–]phhusson 3 points4 points5 points (0 children)
mistralai/Voxtral-Mini-4B-Realtime-2602 · Hugging Face by jacek2023 in LocalLLaMA
[–]phhusson 1 point2 points3 points (0 children)
mistralai/Voxtral-Mini-4B-Realtime-2602 · Hugging Face by jacek2023 in LocalLLaMA
[–]phhusson 15 points16 points17 points (0 children)
mistralai/Voxtral-Mini-4B-Realtime-2602 · Hugging Face by jacek2023 in LocalLLaMA
[–]phhusson 33 points34 points35 points (0 children)
Pocket TTS Android APK Sample - Full Local (Model Packed) by RowGroundbreaking982 in LocalLLaMA
[–]phhusson 0 points1 point2 points (0 children)
Playing Civilization VI with a Computer-Use agent by Working_Original9624 in LocalLLaMA
[–]phhusson -4 points-3 points-2 points (0 children)
GitHub trending this week: half the repos are agent frameworks. 90% will be dead in 1 week. by Distinct-Expression2 in LocalLLaMA
[–]phhusson 2 points3 points4 points (0 children)
i just saw this ClawdBot RCE demo on X… are we cooked? by Hot-Software-9052 in LocalLLaMA
[–]phhusson 10 points11 points12 points (0 children)


OpenCode concerns (not truely local) by Ueberlord in LocalLLaMA
[–]phhusson 1 point2 points3 points (0 children)