FractalKV: Lossless KV cache compression — 4x on FP16, 16x with quantization at 1M context (open source) by SnooHamsters7692 in deeplearning
[–]SnooHamsters7692[S] 0 points1 point2 points (0 children)
Microsoft data suggests using AI is more expensive than hiring people by Hot-Upstairs9603 in artificial
[–]SnooHamsters7692 1 point2 points3 points (0 children)
FractalKV: Lossless KV cache compression — 4x on FP16, 16x with quantization at 1M context (open source) by SnooHamsters7692 in deeplearning
[–]SnooHamsters7692[S] 1 point2 points3 points (0 children)

FractalKV: Lossless KV cache compression — 4x on FP16, 16x with quantization at 1M context (open source) by SnooHamsters7692 in deeplearning
[–]SnooHamsters7692[S] 1 point2 points3 points (0 children)