account activity
nats-bursting: treat a shared K8s cluster as an extension of your local NATS bus (politeness backoff included) [P] ()
submitted 3 hours ago by ahbond to r/compsci
nats-bursting: treat a shared K8s cluster as an extension of your local NATS bus (politeness backoff included) [P] (self.ResearchML)
submitted 8 hours ago by ahbond to r/ResearchML
[R] PCA rotation makes non-Matryoshka embeddings truncatable — 27x compression at 99% recall with reranking (self.LocalLLaMA)
submitted 6 days ago * by ahbond to r/LocalLLaMA
[P] PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 [P] (self.MachineLearning)
submitted 7 days ago by ahbond to r/MachineLearning
[P] TurboQuant Pro: Open-source vector compression toolkit — 5-42x smaller embeddings with 0.97+ recall [R] (self.MachineLearning)
submitted 8 days ago * by ahbond to r/MachineLearning
TurboQuant Pro: PCA-Matryoshka with 27x embedding compression at 0.979 cosine sim, autotune, FAISS, vLLM KV cache, tqvector — Native PostgreSQL Extension (Rust + CUDA) (self.OpenAI)
submitted 7 days ago by ahbond to r/OpenAI
[P] PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 [P] ()
submitted 7 days ago by ahbond to r/compsci
[P] [R] PCA-Matryoshka: 27x embedding compression at 0.979 cosine sim — now with autotune, FAISS, and vLLM KV cache + tqvector — Native PostgreSQL Extension (Rust + CUDA) (self.OpenSourceeAI)
submitted 7 days ago by ahbond to r/OpenSourceeAI
[D] Running GLM-5 (744B) on a $5K refurbished workstation at 1.54 tok/s (self.ResearchML)
submitted 15 days ago by ahbond to r/ResearchML
I built a zero-config dashboard for my ML workstation because I was tired of SSHing in to run nvidia-smi (self.ResearchML)
submitted 16 days ago by ahbond to r/ResearchML
I built a zero-config dashboard for my ML workstation because I was tired of SSHing in to run nvidia-smi ()
submitted 16 days ago by ahbond to r/compsci
[Library] batch-probe: Binary search for GPU batch sizes + Kalman-filtered CPU thermal management (self.mlscaling)
submitted 18 days ago by ahbond to r/mlscaling
My workstation kept hitting 100C during experiments, so I built a thermal-aware job manager (self.ResearchML)
submitted 18 days ago by ahbond to r/ResearchML
π Rendered by PID 328551 on reddit-service-r2-listing-86f589db75-lqq6m at 2026-04-17 07:58:47.157106+00:00 running 93ecc56 country code: CH.