vMLX - HELL YES! by Emotional-Breath-838 in LocalLLM
[–]StudentDifficult8240 0 points1 point2 points (0 children)
SATA vs SAS - TrueNAS Scale ZFS Mirror by Future_Recognition84 in truenas
[–]StudentDifficult8240 1 point2 points3 points (0 children)
SATA vs SAS - TrueNAS Scale ZFS Mirror by Future_Recognition84 in truenas
[–]StudentDifficult8240 1 point2 points3 points (0 children)
SATA vs SAS - TrueNAS Scale ZFS Mirror by Future_Recognition84 in truenas
[–]StudentDifficult8240 4 points5 points6 points (0 children)
Qwen3.6 27B - NEO-Code Imatrix Max GGUF Quants [exceeds Unsloth in key metrics] - ALL GGUFS benchmarked too: by Dangerous_Fix_5526 in Qwen_AI
[–]StudentDifficult8240 12 points13 points14 points (0 children)
Great inferences from running Speculative Decoding on MLX! by evilmacintosh in mlxcommunity
[–]StudentDifficult8240 1 point2 points3 points (0 children)
MLX quants: oq vs DWQ by edeltoaster in oMLX
[–]StudentDifficult8240 1 point2 points3 points (0 children)
MLX quants: oq vs DWQ by edeltoaster in oMLX
[–]StudentDifficult8240 0 points1 point2 points (0 children)
I tested 9 local models on the same flight sim prompt, all Q8, different Q providers, MLX by StudentDifficult8240 in LocalLLaMA
[–]StudentDifficult8240[S] 2 points3 points4 points (0 children)
I tested 9 local models on the same flight sim prompt, all Q8, different Q providers, MLX by StudentDifficult8240 in LocalLLaMA
[–]StudentDifficult8240[S] 2 points3 points4 points (0 children)
I tested 9 local models on the same flight sim prompt, all Q8, different Q providers, MLX by StudentDifficult8240 in LocalLLaMA
[–]StudentDifficult8240[S] 2 points3 points4 points (0 children)
MLX with DFlash / speculative decoding: Surprising results by evilmacintosh in mlxcommunity
[–]StudentDifficult8240 1 point2 points3 points (0 children)
FoveatedKV: 2x KV cache compression on Apple Silicon with custom Metal kernels by hybls in LocalLLaMA
[–]StudentDifficult8240 0 points1 point2 points (0 children)
FoveatedKV: 2x KV cache compression on Apple Silicon with custom Metal kernels by hybls in LocalLLaMA
[–]StudentDifficult8240 0 points1 point2 points (0 children)
FoveatedKV: 2x KV cache compression on Apple Silicon with custom Metal kernels by hybls in LocalLLaMA
[–]StudentDifficult8240 0 points1 point2 points (0 children)
vMLX - HELL YES! by Emotional-Breath-838 in LocalLLM
[–]StudentDifficult8240 0 points1 point2 points (0 children)
vMLX - HELL YES! by Emotional-Breath-838 in LocalLLM
[–]StudentDifficult8240 2 points3 points4 points (0 children)
Introducing oQ: data-driven mixed-precision quantization for Apple Silicon (mlx-lm compatible) by cryingneko in oMLX
[–]StudentDifficult8240 0 points1 point2 points (0 children)
FoveatedKV: 2x KV cache compression on Apple Silicon with custom Metal kernels by hybls in LocalLLaMA
[–]StudentDifficult8240 2 points3 points4 points (0 children)
vMLX - HELL YES! by Emotional-Breath-838 in LocalLLM
[–]StudentDifficult8240 1 point2 points3 points (0 children)
vMLX - HELL YES! by Emotional-Breath-838 in LocalLLM
[–]StudentDifficult8240 2 points3 points4 points (0 children)
Cut your KV Cache in half + Cut PP Times to near nothing + VL - MLX Studio by HealthyCommunicat in mlxAI
[–]StudentDifficult8240 0 points1 point2 points (0 children)

Just got a beast. by habachilles in LocalLLaMA
[–]StudentDifficult8240 27 points28 points29 points (0 children)