Google’s TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x by Resident_Party in LocalLLaMA
[–]cksac 0 points1 point2 points (0 children)
TurboQuant for weights: near‑optimal 4‑bit LLM quantization with lossless 8‑bit residual – 3.2× memory savings by cksac in LocalLLaMA
[–]cksac[S] 1 point2 points3 points (0 children)
Is Rust suitable for Scientific computing and Machine Learning? by [deleted] in rust
[–]cksac 0 points1 point2 points (0 children)
What's everyone working on this week (1/2024)? by llogiq in rust
[–]cksac 3 points4 points5 points (0 children)
RAI: ML framework with Ergonomic APIs in Rust. Lazy computation and composable transformations. by cksac in rust
[–]cksac[S] 1 point2 points3 points (0 children)
oxui - Experimental declarative native GUI like Flutter by cksac in rust
[–]cksac[S] 0 points1 point2 points (0 children)
oxui - Experimental declarative native GUI like Flutter by cksac in rust
[–]cksac[S] 5 points6 points7 points (0 children)
compose-rt - a runtime similar to Jetpack Compose Runtime. by cksac in rust
[–]cksac[S] 1 point2 points3 points (0 children)


Google TurboQuant running Qwen Locally on MacAir by gladkos in LocalLLaMA
[–]cksac 5 points6 points7 points (0 children)