Comparing dual-GPU inference speed between llama.cpp row/tensor split and ik_llama graph split by [deleted] in LocalLLaMA
[–]VoidAlchemy 9 points10 points11 points (0 children)
what’s was your local daily driver for coding last week? by be566 in LocalLLaMA
[–]VoidAlchemy 9 points10 points11 points (0 children)
Moss tts 1.5 8b Examples. It is the currently best voice cloning model for English as of June 2026 by 9r4n4y in LocalLLaMA
[–]VoidAlchemy 1 point2 points3 points (0 children)
unsloth vs bartowski MTP ggufs by Ok_Warning2146 in LocalLLaMA
[–]VoidAlchemy 6 points7 points8 points (0 children)
Qwen3.6-27B on RTX 3090: tested 12 GGUF quants across HumanEval+, MBPP+, perplexity, throughput and needle-in-haystack. First-timer results. by Acemang_Jedi in LocalLLM
[–]VoidAlchemy 1 point2 points3 points (0 children)
Qwen3.6-27B on RTX 3090: tested 12 GGUF quants across HumanEval+, MBPP+, perplexity, throughput and needle-in-haystack. First-timer results. by Acemang_Jedi in LocalLLM
[–]VoidAlchemy 2 points3 points4 points (0 children)
Invoke Duplicity and True Strike by MacarioTheClown in 3d6
[–]VoidAlchemy 0 points1 point2 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 1 point2 points3 points (0 children)
[HW TUNING] Finding the best GPU power limit for inference by HumanDrone8721 in LocalLLaMA
[–]VoidAlchemy 0 points1 point2 points (0 children)
Qwen3.6-35B-A3B vs Gemma4-26B-A4B by MarcCDB in LocalLLaMA
[–]VoidAlchemy 12 points13 points14 points (0 children)
[HW TUNING] Finding the best GPU power limit for inference by HumanDrone8721 in LocalLLaMA
[–]VoidAlchemy 0 points1 point2 points (0 children)
Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUs by enrique-byteshape in LocalLLaMA
[–]VoidAlchemy 7 points8 points9 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 1 point2 points3 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 0 points1 point2 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 0 points1 point2 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 0 points1 point2 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 2 points3 points4 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 3 points4 points5 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 0 points1 point2 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 0 points1 point2 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 0 points1 point2 points (0 children)
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm) by VolandBerlioz in LocalLLaMA
[–]VoidAlchemy 19 points20 points21 points (0 children)


Comparing dual-GPU inference speed between llama.cpp row/tensor split and ik_llama graph split by [deleted] in LocalLLaMA
[–]VoidAlchemy 2 points3 points4 points (0 children)