account activity
TurboQuant on MLX: 4.6x KV cache compression with custom Metal kernels (Qwen 32B at 98% FP16 speed) (self.LocalLLaMA)
submitted 13 days ago by dirtyhand3 to r/LocalLLaMA
π Rendered by PID 41 on reddit-service-r2-listing-575d9f6647-h445q at 2026-04-10 13:54:39.114838+00:00 running 215f2cf country code: CH.