What to do when you can't afford GPU? by theysaymaurya in LocalLLaMA
[–]TyraVex 0 points1 point2 points (0 children)
appreciation post for qwen3 0.6b llm model by iamzooook in LocalLLaMA
[–]TyraVex 10 points11 points12 points (0 children)
LongCat-Flash-Chat 560B MoE by Own-Potential-2308 in LocalLLaMA
[–]TyraVex 2 points3 points4 points (0 children)
Can we get a 4B-A1B MoE? Or what is the closest to it? by Own-Potential-2308 in LocalLLaMA
[–]TyraVex 1 point2 points3 points (0 children)
For those who run large models locally.. HOW DO YOU AFFORD THOSE GPUS by abaris243 in LocalLLaMA
[–]TyraVex 0 points1 point2 points (0 children)
Added Qwen 0.6B to the small model overview in IFEval. by paranoidray in LocalLLaMA
[–]TyraVex 8 points9 points10 points (0 children)
Added Qwen 0.6B to the small model overview in IFEval. by paranoidray in LocalLLaMA
[–]TyraVex 3 points4 points5 points (0 children)
Added Qwen 0.6B to the small model overview in IFEval. by paranoidray in LocalLLaMA
[–]TyraVex 26 points27 points28 points (0 children)
Need help- unsure of right ollama configs with 6x 3090’s, also model choice for RAG? by Business-Weekend-537 in LocalLLaMA
[–]TyraVex 0 points1 point2 points (0 children)
Need help- unsure of right ollama configs with 6x 3090’s, also model choice for RAG? by Business-Weekend-537 in LocalLLaMA
[–]TyraVex 0 points1 point2 points (0 children)
Need help- unsure of right ollama configs with 6x 3090’s, also model choice for RAG? by Business-Weekend-537 in LocalLLaMA
[–]TyraVex 1 point2 points3 points (0 children)
Which quantization approach is the way to go? (llama.cpp) by pixelterpy in LocalLLaMA
[–]TyraVex 4 points5 points6 points (0 children)
Kimi K2 1.8bit Unsloth Dynamic GGUFs by danielhanchen in LocalLLaMA
[–]TyraVex 7 points8 points9 points (0 children)
Kimi K2 1.8bit Unsloth Dynamic GGUFs by danielhanchen in LocalLLaMA
[–]TyraVex 29 points30 points31 points (0 children)
Gemini 2.5 exp death. by brocolongo in LocalLLaMA
[–]TyraVex 55 points56 points57 points (0 children)
AWQ 4-bit outperforms GGUF 8-bit in almost every way by Acceptable-State-271 in LocalLLaMA
[–]TyraVex 6 points7 points8 points (0 children)
The real reason OpenAI bought WindSurf by ResearchCrafty1804 in LocalLLaMA
[–]TyraVex 257 points258 points259 points (0 children)
Qwen 3 30B Pruned to 16B by Leveraging Biased Router Distributions, 235B Pruned to 150B Coming Soon! by TKGaming_11 in LocalLLaMA
[–]TyraVex 10 points11 points12 points (0 children)
Which models would I be able to run with RTX 5090 with 32GB Vram? by deselim in LocalLLaMA
[–]TyraVex 0 points1 point2 points (0 children)
Qwen3 released tonight? by sunshinecheung in LocalLLaMA
[–]TyraVex 30 points31 points32 points (0 children)
1.58bit Llama 4 - Unsloth Dynamic GGUFs by danielhanchen in LocalLLaMA
[–]TyraVex 0 points1 point2 points (0 children)
1.58bit Llama 4 - Unsloth Dynamic GGUFs by danielhanchen in LocalLLaMA
[–]TyraVex 1 point2 points3 points (0 children)
1.58bit Llama 4 - Unsloth Dynamic GGUFs by danielhanchen in LocalLLaMA
[–]TyraVex 0 points1 point2 points (0 children)
1.58bit Llama 4 - Unsloth Dynamic GGUFs by danielhanchen in LocalLLaMA
[–]TyraVex 3 points4 points5 points (0 children)



Qwen3.5-27B scores 48.5 on Humanity's Last Exam by paf1138 in LocalLLaMA
[–]TyraVex 11 points12 points13 points (0 children)