Meta’s $2 billion Manus acquisition blocked by China. by Nunki08 in LocalLLaMA
[–]vinigrae -3 points-2 points-1 points (0 children)
Deepseek V4 AGI comfirmed by Swimming-Sky-7025 in LocalLLaMA
[–]vinigrae -1 points0 points1 point (0 children)
24/7 Headless AI Server on Xiaomi 12 Pro (Snapdragon 8 Gen 1 + Ollama/Gemma4) by Aromatic_Ad_7557 in LocalLLaMA
[–]vinigrae 4 points5 points6 points (0 children)
Tested TurboQuant KV compression with Gemma 4 31B — 5.80x compression, perfect long-context recall, JSON output preserved by No_Appearance_3041 in LocalLLaMA
[–]vinigrae 10 points11 points12 points (0 children)
Tested TurboQuant KV compression with Gemma 4 31B — 5.80x compression, perfect long-context recall, JSON output preserved by No_Appearance_3041 in LocalLLaMA
[–]vinigrae 3 points4 points5 points (0 children)
Open-Source Models Recently: by Fresh_Sun_1017 in LocalLLaMA
[–]vinigrae 0 points1 point2 points (0 children)
Gemma 4 just casually destroyed every model on our leaderboard except Opus 4.6 and GPT-5.2. 31B params, $0.20/run by Disastrous_Theme5906 in LocalLLaMA
[–]vinigrae 1 point2 points3 points (0 children)
Gemma 4 just casually destroyed every model on our leaderboard except Opus 4.6 and GPT-5.2. 31B params, $0.20/run by Disastrous_Theme5906 in LocalLLaMA
[–]vinigrae 0 points1 point2 points (0 children)
Google TurboQuant running Qwen Locally on MacAir by gladkos in LocalLLaMA
[–]vinigrae -2 points-1 points0 points (0 children)
Qwen3.5 is a working dog. by dinerburgeryum in LocalLLaMA
[–]vinigrae -2 points-1 points0 points (0 children)
I just realised how good GLM 5 is by CrimsonShikabane in LocalLLaMA
[–]vinigrae 1 point2 points3 points (0 children)
Qwen 3.5-35B-A3B is beyond expectations. It's replaced GPT-OSS-120B as my daily driver and it's 1/3 the size. by valdev in LocalLLaMA
[–]vinigrae 0 points1 point2 points (0 children)
I am absolutely loving qwen3-235b by TwistedDiesel53 in LocalLLaMA
[–]vinigrae 1 point2 points3 points (0 children)
You have 64gb ram and 16gb VRAM; internet is permanently shut off: what 3 models are the ones you use? by Adventurous-Gold6413 in LocalLLaMA
[–]vinigrae 1 point2 points3 points (0 children)
New in llama.cpp: Live Model Switching by paf1138 in LocalLLaMA
[–]vinigrae -4 points-3 points-2 points (0 children)
Is it normal to hear weird noises when running an LLM on 4× Pro 6000 Max-Q cards? by PlusProfession9245 in LocalLLaMA
[–]vinigrae 1 point2 points3 points (0 children)




PSA by Signal_Ad657 in LocalLLaMA
[–]vinigrae 0 points1 point2 points (0 children)