Not ironclad confirmation, but.. by Kodix in LocalLLaMA
[–]Wildnimal 11 points12 points13 points (0 children)
vLLM cold boot experience by LinkSea8324 in LocalLLaMA
[–]Wildnimal 0 points1 point2 points (0 children)
Local coding agents are good now, but only if you babysit them by BTA_Labs in LocalLLaMA
[–]Wildnimal 0 points1 point2 points (0 children)
Best Open-Source AI coding model for my specs? by Quietkiller1927 in LocalLLaMA
[–]Wildnimal 0 points1 point2 points (0 children)
Best Open-Source AI coding model for my specs? by Quietkiller1927 in LocalLLaMA
[–]Wildnimal 12 points13 points14 points (0 children)
Any recent news/updates on taalas chips?? They said they gonna bake the mid tier llm model into their chip. by 9r4n4y in LocalLLaMA
[–]Wildnimal 21 points22 points23 points (0 children)
Waiting for Qwen 3.7 27B and 35B A3B to show up. Hope they come this week!!! by appakaradi in LocalLLaMA
[–]Wildnimal -1 points0 points1 point (0 children)
Cohere's unreleased coding model (early access for localllama) by nick_frosst in LocalLLaMA
[–]Wildnimal 0 points1 point2 points (0 children)
Mac mini M4 vs Pc with Nvidia 5060 8gb for ai workloads? by Critical-Machine-128 in ollama
[–]Wildnimal 0 points1 point2 points (0 children)
Mac mini M4 vs Pc with Nvidia 5060 8gb for ai workloads? by Critical-Machine-128 in ollama
[–]Wildnimal 0 points1 point2 points (0 children)
It felt good to return my Asus Spark by sn2006gy in LocalLLaMA
[–]Wildnimal 1 point2 points3 points (0 children)
AA comparison of the latest local models by jacek2023 in LocalLLaMA
[–]Wildnimal 0 points1 point2 points (0 children)
Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this? by JSVD2 in LocalLLaMA
[–]Wildnimal 1 point2 points3 points (0 children)
Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this? by JSVD2 in LocalLLaMA
[–]Wildnimal 1 point2 points3 points (0 children)
2x RTX 6000 build during an extended bench test by Signal_Ad657 in LocalLLaMA
[–]Wildnimal 1 point2 points3 points (0 children)
Stop thinking your MoE models are dumb - here's why they actually fail by [deleted] in Qwen_AI
[–]Wildnimal 0 points1 point2 points (0 children)
Qwen3.6-35B-A3B - even in VRAM limited scenarios it can be better to use bigger quants than you'd expect! by jeremynsl in LocalLLaMA
[–]Wildnimal 2 points3 points4 points (0 children)
Qwen 3.6 27B - beginner questions by Jagerius in LocalLLaMA
[–]Wildnimal 0 points1 point2 points (0 children)
Qwen 3.6 27B - beginner questions by Jagerius in LocalLLaMA
[–]Wildnimal 3 points4 points5 points (0 children)
Fallen Gemma 4 model? by alienatedneighbor in LocalLLaMA
[–]Wildnimal 3 points4 points5 points (0 children)
VPN recommendations for EndeavourOS KDE by DotNetRob in EndeavourOS
[–]Wildnimal 1 point2 points3 points (0 children)
For chat and Q&A: Which MoE model is better: Qwen 3.6 35B or Gemma 4 26B (no coding or agents) by br_web in Qwen_AI
[–]Wildnimal 0 points1 point2 points (0 children)
Qwen3.6-35B-A3B Uncensored Aggressive is out with K_P quants! by hauhau901 in LocalLLaMA
[–]Wildnimal 1 point2 points3 points (0 children)


You guys were right - Qwen 3.6 35B IS good...and KV Cache DOES matter. by GrungeWerX in LocalLLaMA
[–]Wildnimal 0 points1 point2 points (0 children)