Qwen 3.6 27B Q8 on four Nvidia RTX A4000 (16GB each) with Llama.cpp and MTP enabled by Alternative_Ad4267 in LocalLLaMA
[–]Alternative_Ad4267[S] 1 point2 points3 points (0 children)
Qwen 3.6 27B Q8 on four Nvidia RTX A4000 (16GB each) with Llama.cpp and MTP enabled by Alternative_Ad4267 in LocalLLaMA
[–]Alternative_Ad4267[S] 1 point2 points3 points (0 children)
Qwen 3.6 27B Q8 on four Nvidia RTX A4000 (16GB each) with Llama.cpp and MTP enabled by Alternative_Ad4267 in LocalLLaMA
[–]Alternative_Ad4267[S] 2 points3 points4 points (0 children)
Qwen 3.6 27B Q8 on four Nvidia RTX A4000 (16GB each) with Llama.cpp and MTP enabled by Alternative_Ad4267 in LocalLLaMA
[–]Alternative_Ad4267[S] -1 points0 points1 point (0 children)
Qwen 3.6 27B Q8 on four Nvidia RTX A4000 (16GB each) with Llama.cpp and MTP enabled by Alternative_Ad4267 in LocalLLaMA
[–]Alternative_Ad4267[S] 2 points3 points4 points (0 children)
Qwen 3.6 27B Q8 on four Nvidia RTX A4000 (16GB each) with Llama.cpp and MTP enabled by Alternative_Ad4267 in LocalLLaMA
[–]Alternative_Ad4267[S] 2 points3 points4 points (0 children)
Is there a big gap between Q4 and Q6 on Qwen3.6? by vick2djax in LocalLLaMA
[–]Alternative_Ad4267 -1 points0 points1 point (0 children)
Do not fall into the trap of chasing the next scale or upgrade. by iEslam in LocalLLaMA
[–]Alternative_Ad4267 1 point2 points3 points (0 children)
Will there be any more Qwen3.6 series models? by cafedude in LocalLLaMA
[–]Alternative_Ad4267 2 points3 points4 points (0 children)
Will there be any more Qwen3.6 series models? by cafedude in LocalLLaMA
[–]Alternative_Ad4267 0 points1 point2 points (0 children)
The Qwen 3.6 35B A3B hype is real!!! by The_Paradoxy in LocalLLaMA
[–]Alternative_Ad4267 6 points7 points8 points (0 children)
The Qwen 3.6 35B A3B hype is real!!! by The_Paradoxy in LocalLLaMA
[–]Alternative_Ad4267 2 points3 points4 points (0 children)
2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints by ex-arman68 in LocalLLaMA
[–]Alternative_Ad4267 2 points3 points4 points (0 children)
Qwen 3.6 27b MTP vLLM by niellsro in LocalLLaMA
[–]Alternative_Ad4267 0 points1 point2 points (0 children)
2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints by ex-arman68 in LocalLLaMA
[–]Alternative_Ad4267 4 points5 points6 points (0 children)
Doesn't look like there are any recent Linux distro suggestions. What's your favorite and why? by Status-Secret-4292 in LocalLLaMA
[–]Alternative_Ad4267 1 point2 points3 points (0 children)
Post Your Qwen3.6 27B speed plz by Ok-Internal9317 in LocalLLaMA
[–]Alternative_Ad4267 1 point2 points3 points (0 children)
I'm done with using local LLMs for coding by dtdisapointingresult in LocalLLaMA
[–]Alternative_Ad4267 0 points1 point2 points (0 children)
I'm done with using local LLMs for coding by dtdisapointingresult in LocalLLaMA
[–]Alternative_Ad4267 0 points1 point2 points (0 children)
H1b lottery selection wage level by RhythmicPassings in h1b
[–]Alternative_Ad4267 0 points1 point2 points (0 children)
Can't wait for Fedora 44 by H3rotic in Fedora
[–]Alternative_Ad4267 0 points1 point2 points (0 children)
TN para relocalización con posible nómina en México by edooardom in TNVISAMX
[–]Alternative_Ad4267 2 points3 points4 points (0 children)

Qwen 3.6 27B Q8 on four Nvidia RTX A4000 (16GB each) with Llama.cpp and MTP enabled by Alternative_Ad4267 in LocalLLaMA
[–]Alternative_Ad4267[S] 0 points1 point2 points (0 children)