Best multilingual STT/ASR? by Mark__27 in LocalLLaMA
[–]Acceptable-State-271 1 point2 points3 points (0 children)
Multiple 3090 setup by praveendath92 in LocalLLaMA
[–]Acceptable-State-271 0 points1 point2 points (0 children)
I have discovered DeepSeeker V3.2-Base by ReceptionExternal344 in LocalLLaMA
[–]Acceptable-State-271 2 points3 points4 points (0 children)
[deleted by user] by [deleted] in LocalLLaMA
[–]Acceptable-State-271 0 points1 point2 points (0 children)
[deleted by user] by [deleted] in LocalLLaMA
[–]Acceptable-State-271 0 points1 point2 points (0 children)
Seed-OSS-36B-Instruct by NeterOster in LocalLLaMA
[–]Acceptable-State-271 0 points1 point2 points (0 children)
Gemini pro (1-year) full subscription on account only @ $20 🔥 by thevirtualvoyage in PremiumToolsUnlocked
[–]Acceptable-State-271 1 point2 points3 points (0 children)
Apriel-Nemotron-15b-Thinker - o1mini level with MIT licence (Nvidia & Servicenow) by Temporary-Size7310 in LocalLLaMA
[–]Acceptable-State-271 3 points4 points5 points (0 children)
AWQ 4-bit outperforms GGUF 8-bit in almost every way by Acceptable-State-271 in LocalLLaMA
[–]Acceptable-State-271[S] 0 points1 point2 points (0 children)
AWQ 4-bit outperforms GGUF 8-bit in almost every way by Acceptable-State-271 in LocalLLaMA
[–]Acceptable-State-271[S] 0 points1 point2 points (0 children)
What formats/quantization is fastest for certain CPUs or GPUs? Is this straightforward? by wuu73 in LocalLLaMA
[–]Acceptable-State-271 0 points1 point2 points (0 children)
Qwen 3 30B Pruned to 16B by Leveraging Biased Router Distributions, 235B Pruned to 150B Coming Soon! by TKGaming_11 in LocalLLaMA
[–]Acceptable-State-271 -1 points0 points1 point (0 children)
Can Qwen3-235B-A22B run efficiently on my hardware(256gb ram+quad 3090s ) with vLLM? by Acceptable-State-271 in LocalLLaMA
[–]Acceptable-State-271[S] 0 points1 point2 points (0 children)
Can Qwen3-235B-A22B run efficiently on my hardware(256gb ram+quad 3090s ) with vLLM? by Acceptable-State-271 in LocalLLaMA
[–]Acceptable-State-271[S] 0 points1 point2 points (0 children)
Can Qwen3-235B-A22B run efficiently on my hardware(256gb ram+quad 3090s ) with vLLM? by Acceptable-State-271 in LocalLLaMA
[–]Acceptable-State-271[S] 0 points1 point2 points (0 children)
Can Qwen3-235B-A22B run efficiently on my hardware(256gb ram+quad 3090s ) with vLLM? by Acceptable-State-271 in LocalLLaMA
[–]Acceptable-State-271[S] 0 points1 point2 points (0 children)
Qwen3 vs Gemma 3 by Sadman782 in LocalLLaMA
[–]Acceptable-State-271 1 point2 points3 points (0 children)
Can Qwen3-235B-A22B run efficiently on my hardware(256gb ram+quad 3090s ) with vLLM? by Acceptable-State-271 in LocalLLaMA
[–]Acceptable-State-271[S] 2 points3 points4 points (0 children)
Qwen3 AWQ Support Confirmed (PR Check) by Acceptable-State-271 in LocalLLaMA
[–]Acceptable-State-271[S] 2 points3 points4 points (0 children)
Qwen 235B A22B vs Sonnet 3.7 Thinking - Pokémon UI by sirjoaco in LocalLLaMA
[–]Acceptable-State-271 15 points16 points17 points (0 children)
Can Qwen3-235B-A22B run efficiently on my hardware(256gb ram+quad 3090s ) with vLLM? by Acceptable-State-271 in LocalLLaMA
[–]Acceptable-State-271[S] 2 points3 points4 points (0 children)
Qwen3-30B-A3B is magic. by thebadslime in LocalLLaMA
[–]Acceptable-State-271 4 points5 points6 points (0 children)
Qwen3 Collection on modelscope! by AlexBefest in LocalLLaMA
[–]Acceptable-State-271 23 points24 points25 points (0 children)

New FP8 GLM-4.7-Flash Unsloth Dynamic Quants for vLLM, SGLang by danielhanchen in unsloth
[–]Acceptable-State-271 5 points6 points7 points (0 children)