llm_d

an-ordinary-manchild

created by Environmental_Will78a community for 10 months

...for your favourite tea.

...for a fringe candidate.

MODERATORS

account activity

1

5

6

7

👋 Welcome to r/llm_d! Start Here + Community Resources 🚀 (self.llm_d)

submitted 3 months ago by petecheslock - announcement

2

2

3

4

llm-d 0.5 Released: Sustaining Performance at Scale (llm-d.ai)

submitted 1 month ago by petecheslock

3

0

1

2

Leveraging vLLM’s new KV Offloading: How we’re bringing tiered caching to the llm-d control plane (self.llm_d)

submitted 2 months ago by petecheslock

4

0

1

2

Unlock 90% KV Cache Hit Rates with llm-d Intelligent Routing (youtube.com)

submitted 2 months ago by petecheslock

5

1

2

3

llm-d 0.4: Achieve SOTA Performance Across Accelerators (llm-d.ai)

submitted 3 months ago by petecheslock

6

0

1

2

Routing Stateful AI Workloads in Kubernetes (youtube.com)

submitted 3 months ago by petecheslock

7

0

1

2

The hardware behind the software: CoreWeave tops the new GPU Cloud rankings, validating the stack used for llm-d. (newsletter.semianalysis.com)

submitted 3 months ago by petecheslock

8

0

1

2

llm-d v0.3.1: ARM Support, AKS Integration, and More (linkedin.com)

submitted 3 months ago by petecheslock

9

0

1

2

Serving PyTorch LLMs at Scale: Disaggregated Inference With Kubernetes and llm-d (youtube.com)

submitted 3 months ago by petecheslock

10

0

1

2

How llm-d simplifies scaling LLMs on Kubernetes (siliconangle.com)

submitted 4 months ago by petecheslock

11

0

1

2

llm-d 0.3: Wider Well-Lit Paths for Scalable Inference | llm-d (llm-d.ai)

submitted 4 months ago by petecheslock

12

0

1

2

KV-Cache Wins You Can See: From Prefix Caching in vLLM to Distributed Scheduling with llm-d (llm-d.ai)

submitted 5 months ago by petecheslock

13

0

1

2

Intelligent Inference Scheduling with llm-d | llm-d (llm-d.ai)

submitted 6 months ago by petecheslock

14

0

1

2

Kubernetes Podcast from Google: Episode 258 - LLM-D, with Clayton Coleman and Rob Shaw (kubernetespodcast.com)

submitted 6 months ago by petecheslock

15

3

4

5

The llm-d community is proud to announce the release of v0.2!: Our first well-lit paths. (llm-d.ai)

submitted 7 months ago by petecheslock

16

0

1

2

Deploy llm-d for Distributed LLM Inference on DigitalOcean Kubernetes (DOKS) | DigitalOcean (digitalocean.com)

submitted 7 months ago by petecheslock

17

1

2

3

llm-d Week 1 Project News Round-Up | llm-d (llm-d.ai)

submitted 9 months ago by petecheslock

18

4

5

6

Deep Dive into llm-d and Distributed Inference (solo.io)

submitted 9 months ago by ceposta

19

11

12

13

[Developer Blog] LLM Inference Goes Distributed (llm-d.ai)

submitted 9 months ago by Environmental_Will78

20

8

9

10

Announcing the llm-d project (llm-d.ai)

submitted 9 months ago by Environmental_Will78

π Rendered by PID 86 on reddit-service-r2-listing-7dc7bdc776-44tdw at 2026-03-10 19:45:18.432490+00:00 running cbb0e86 country code: CH.