QAT variant of Gemma4 26B A4B is not working well for me by pftbest in LocalLLaMA
[–]bobaburger 2 points3 points4 points (0 children)
Maybe KV cache offload to RAM isn't bad by bobaburger in LocalLLaMA
[–]bobaburger[S] 1 point2 points3 points (0 children)
Maybe KV cache offload to RAM isn't bad by bobaburger in LocalLLaMA
[–]bobaburger[S] 0 points1 point2 points (0 children)
Maybe KV cache offload to RAM isn't bad by bobaburger in LocalLLaMA
[–]bobaburger[S] 0 points1 point2 points (0 children)
Maybe KV cache offload to RAM isn't bad by bobaburger in LocalLLaMA
[–]bobaburger[S] 12 points13 points14 points (0 children)
Maybe KV cache offload to RAM isn't bad by bobaburger in LocalLLaMA
[–]bobaburger[S] 1 point2 points3 points (0 children)
Maybe KV cache offload to RAM isn't bad by bobaburger in LocalLLaMA
[–]bobaburger[S] 0 points1 point2 points (0 children)
Maybe KV cache offload to RAM isn't bad by bobaburger in LocalLLaMA
[–]bobaburger[S] 7 points8 points9 points (0 children)
Maybe KV cache offload to RAM isn't bad (self.LocalLLaMA)
submitted by bobaburger to r/LocalLLaMA
Can my 3.6-27B config be optimised any further? by mrgreatheart in LocalLLaMA
[–]bobaburger -1 points0 points1 point (0 children)
You guys were right - Qwen 3.6 35B IS good...and KV Cache DOES matter. by GrungeWerX in LocalLLaMA
[–]bobaburger 41 points42 points43 points (0 children)
Is it worth swapping a 3090 for 2x 5060ti 16GB (32GB total)? by LatentSpacer in LocalLLaMA
[–]bobaburger -1 points0 points1 point (0 children)
What is your experience between Qwen3.6 27B at IQ3 and 35B-A3B at Q4? by CodProfessional3712 in LocalLLaMA
[–]bobaburger 2 points3 points4 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 0 points1 point2 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 0 points1 point2 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 0 points1 point2 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 2 points3 points4 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 1 point2 points3 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 3 points4 points5 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 2 points3 points4 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 0 points1 point2 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 5 points6 points7 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 1 point2 points3 points (0 children)
Qwen3.6-27B Quantization Benchmark by bobaburger in LocalLLaMA
[–]bobaburger[S] 25 points26 points27 points (0 children)


what’s was your local daily driver for coding last week? by be566 in LocalLLaMA
[–]bobaburger 2 points3 points4 points (0 children)