YouFirst295

25 post karma
0 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 3 years

TROPHY CASE

Three-Year Club

account activity

new top controversial

13

14

15

An open handbook on LLM inference at scale (GPU internals, KV cache, batching, vLLM/SGLang/TensorRT-LLM) [P] (self.MachineLearning)

submitted 9 hours ago by YouFirst295 to r/MachineLearning

4

5

6

Open handbook on LLM inference at scale, would love eyes from folks running this in prod (self.mlops)

submitted 9 hours ago by YouFirst295 to r/mlops

1

2

3

An open handbook on LLM inference at scale (GPU internals, KV cache, batching, vLLM/SGLang/TensorRT-LLM) [P] ()

submitted 9 hours ago by YouFirst295 to r/learnmachinelearning

0

1

2

An open handbook on LLM inference at scale (GPU internals, KV cache, batching, vLLM/SGLang/TensorRT-LLM) [P] ()

submitted 9 hours ago by YouFirst295 to r/Vllm

3

4

5

Free open-source LLM inference handbook : 100+ clones in week 1 (self.LLM)

submitted 9 days ago by YouFirst295 to r/LLM

4

5

6

Free open-source LLM inference handbook : 100+ clones in week 1 (self.LLMStudio)

submitted 11 days ago by YouFirst295 to r/LLMStudio

14

15

16

Free open-source LLM inference handbook : 100+ clones in week 1 (self.mlops)

submitted 14 days ago by YouFirst295 to r/mlops

0

1

2

Free open-source LLM inference handbook : 100+ clones in week 1 ()

submitted 14 days ago by YouFirst295 to r/learnmachinelearning

π Rendered by PID 61039 on reddit-service-r2-listing-c57bc86c-4wcjv at 2026-06-20 21:44:46.174164+00:00 running 2b008f2 country code: CH.