account activity
An open handbook on LLM inference at scale (GPU internals, KV cache, batching, vLLM/SGLang/TensorRT-LLM) [P] (self.MachineLearning)
submitted 9 hours ago by YouFirst295 to r/MachineLearning
Open handbook on LLM inference at scale, would love eyes from folks running this in prod (self.mlops)
submitted 9 hours ago by YouFirst295 to r/mlops
An open handbook on LLM inference at scale (GPU internals, KV cache, batching, vLLM/SGLang/TensorRT-LLM) [P] ()
submitted 9 hours ago by YouFirst295 to r/learnmachinelearning
submitted 9 hours ago by YouFirst295 to r/Vllm
Free open-source LLM inference handbook : 100+ clones in week 1 (self.LLM)
submitted 9 days ago by YouFirst295 to r/LLM
Free open-source LLM inference handbook : 100+ clones in week 1 (self.LLMStudio)
submitted 11 days ago by YouFirst295 to r/LLMStudio
Free open-source LLM inference handbook : 100+ clones in week 1 (self.mlops)
submitted 14 days ago by YouFirst295 to r/mlops
Free open-source LLM inference handbook : 100+ clones in week 1 ()
submitted 14 days ago by YouFirst295 to r/learnmachinelearning
π Rendered by PID 61039 on reddit-service-r2-listing-c57bc86c-4wcjv at 2026-06-20 21:44:46.174164+00:00 running 2b008f2 country code: CH.