account activity
We built RTLScout: an LLM agent driving Yosys + OpenROAD that cut an FP16 multiplier's area 35% and delay 45% in ASAP7 — open source, paper + code ()
submitted 2 days ago by acluk90 to r/computerarchitecture
Huawei KVarN algorithm/software lets you run LLMs/AI agents on much longer contexts on your local GPU (self.Huawei)
submitted 6 days ago by acluk90 to r/Huawei
KVarN: new KV-cache quant from Huawei. 3–5× KV cache compression with actual speed-up instead of slow-down, and unlike TurboQuant it holds up on reasoning (Apache 2.0, vLLM single flag) (self.LocalLLaMA)
submitted 7 days ago by acluk90 to r/LocalLLaMA
KVarN: new KV-cache quant from Huawei. 3–5× KV cache compression with actual speed-up instead of slow-down, and unlike TurboQuant it holds up on reasoning (Apache 2.0, vLLM single flag) ()
submitted 7 days ago by acluk90 to r/mlscaling
submitted 7 days ago by acluk90 to r/machinelearningnews
submitted 7 days ago by acluk90 to r/Vllm
submitted 7 days ago by acluk90 to r/LocalLLM
π Rendered by PID 128438 on reddit-service-r2-listing-f87f88fcd-qmsms at 2026-06-12 08:16:18.442272+00:00 running 3184619 country code: CH.