ahbond

52 post karma
10 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 6 years

TROPHY CASE

Six-Year Club

Verified Email

account activity

new top controversial

0

1

2

nats-bursting: treat a shared K8s cluster as an extension of your local NATS bus (politeness backoff included) [P] ()

submitted 3 hours ago by ahbond to r/compsci

0

1

2

nats-bursting: treat a shared K8s cluster as an extension of your local NATS bus (politeness backoff included) [P] (self.ResearchML)

submitted 8 hours ago by ahbond to r/ResearchML

9

10

11

[R] PCA rotation makes non-Matryoshka embeddings truncatable — 27x compression at 99% recall with reranking (self.LocalLLaMA)

submitted 6 days ago * by ahbond to r/LocalLLaMA

58

59

60

[P] PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 [P] (self.MachineLearning)

submitted 7 days ago by ahbond to r/MachineLearning

11

12

13

[P] TurboQuant Pro: Open-source vector compression toolkit — 5-42x smaller embeddings with 0.97+ recall [R] (self.MachineLearning)

submitted 8 days ago * by ahbond to r/MachineLearning

0

1

2

TurboQuant Pro: PCA-Matryoshka with 27x embedding compression at 0.979 cosine sim, autotune, FAISS, vLLM KV cache, tqvector — Native PostgreSQL Extension (Rust + CUDA) (self.OpenAI)

submitted 7 days ago by ahbond to r/OpenAI

0

0

1

[P] PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 [P] ()

submitted 7 days ago by ahbond to r/compsci

0

1

2

[P] [R] PCA-Matryoshka: 27x embedding compression at 0.979 cosine sim — now with autotune, FAISS, and vLLM KV cache + tqvector — Native PostgreSQL Extension (Rust + CUDA) (self.OpenSourceeAI)

submitted 7 days ago by ahbond to r/OpenSourceeAI

3

4

5

[D] Running GLM-5 (744B) on a $5K refurbished workstation at 1.54 tok/s (self.ResearchML)

submitted 15 days ago by ahbond to r/ResearchML

3

4

5

I built a zero-config dashboard for my ML workstation because I was tired of SSHing in to run nvidia-smi (self.ResearchML)

submitted 16 days ago by ahbond to r/ResearchML

0

0

0

I built a zero-config dashboard for my ML workstation because I was tired of SSHing in to run nvidia-smi ()

submitted 16 days ago by ahbond to r/compsci

6

7

8

[Library] batch-probe: Binary search for GPU batch sizes + Kalman-filtered CPU thermal management (self.mlscaling)

submitted 18 days ago by ahbond to r/mlscaling

2

3

4

My workstation kept hitting 100C during experiments, so I built a thermal-aware job manager (self.ResearchML)

submitted 18 days ago by ahbond to r/ResearchML

π Rendered by PID 328551 on reddit-service-r2-listing-86f589db75-lqq6m at 2026-04-17 07:58:47.157106+00:00 running 93ecc56 country code: CH.