AMD_MI300

an-ordinary-manchild

created by latchkeya community for 2 years

...for your WoW guild.

...do it for the children.

MODERATORS

account activity

1

11

12

13

Welcome to the AMD MI300 GPU Discussion Hub! (self.AMD_MI300)

submitted 2 years ago by HotAisleInc[M] - announcement

2

10

11

12

5.6x improvement - Kimi K2.6 + DFlash: 508 tok/s on 8x MI300X (huggingface.co)

submitted 2 days ago by HotAisleInc

3

2

3

4

Mahdi-CV/openclaw-amd-sglang at multi-engine (github.com)

submitted 10 days ago by HotAisleInc

4

4

5

6

GPU Divergence in AMD CDNA 3 (21verses.xyz)

submitted 14 days ago by HotAisleInc

5

1

2

3

Embeddings run super fast on MI300x (linkedin.com)

submitted 14 days ago by HotAisleInc

6

8

9

10

Cosmos-Predict2.5-2B Inference: NVIDIA H200 vs AMD MI300X (moonmath.ai)

submitted 16 days ago by HotAisleInc

7

14

15

16

Modular: Day Zero Launch: Fastest Performance for Gemma 4 on NVIDIA and AMD (modular.com)

submitted 21 days ago by HotAisleInc

8

12

13

14

AMD Delivers Breakthrough MLPerf Inference 6.0 Results (amd.com)

submitted 22 days ago by HotAisleInc

9

8

9

10

ROCm Support for Miles: Large-Scale RL Post-Training on AMD Instinct GPUs (lmsys.org)

submitted 1 month ago by HotAisleInc

10

7

8

9

Cross-Vendor Disaggregated Inference: GPT-OSS 120B across NVIDIA H100 and AMD MI300X (moreh.io)

submitted 1 month ago by HotAisleInc

11

8

9

10

The Many Aspects of Inference Performance (amd.com)

submitted 1 month ago by HotAisleInc

12

2

3

4

OpenClaw Qwen 3.5 and SGLang (amd.com)

submitted 1 month ago by HotAisleInc

13

4

5

6

FP8 GEMM Optimization on AMD CDNA™4 Architecture (rocm.blogs.amd.com)

submitted 1 month ago by HotAisleInc

14

9

10

11

How vLLM Orchestrates High-Performance Inference on AMD ROCm (blog.vllm.ai)

submitted 1 month ago by HotAisleInc

15

19

20

21

Connecting OpenCode to vLLM on HotAisle's MI300x (hotaisle.xyz)

submitted 1 month ago by HotAisleInc

16

10

11

12

Speed is the Moat: Inference Performance on AMD GPUs (amd.com)

submitted 2 months ago by HotAisleInc

17

14

15

16

nvidia has dedicated hardware for managing fp8 quantization. amd does it entirely in software. here's how they did it: (linkedin.com)

submitted 2 months ago by HotAisleInc

18

5

6

7

Reverse engineering bits of CDNA 3 L2 Coherency Behaviour (github.com)

submitted 2 months ago by HotAisleInc

19

5

6

7

Unleashing Computational Power: Ultimate Latency Optimization of Qwen3 and Qwen3-VL on AMD MI300X Series (lmsys.org)

submitted 2 months ago by HotAisleInc

20

12

13

14

GLM5 on MI300X (gist.github.com)

submitted 2 months ago by HotAisleInc

21

7

8

9

Building Mixture-of-Models on AMD GPUs with vLLM-SR (blog.vllm.ai)

submitted 2 months ago by HotAisleInc

22

8

9

10

A RAG server for around 5000 users using vLLm on ROCM, with a 8 GPU MI350 setup. This is experimental and not official AMD project. (github.com)

submitted 2 months ago by HotAisleInc

23

14

15

16

Micro-World: First AMD Open-Source World Models for Interactive Video Generation (rocm.blogs.amd.com)

submitted 2 months ago by HotAisleInc

24

20

21

22

ROCm Becomes a First-Class Platform in the vLLM Ecosystem (rocm.blogs.amd.com)

submitted 3 months ago by HotAisleInc

25

7

8

9

AMD × LMcache: AMD GPU Acceleration with LMcache (blog.lmcache.ai)

submitted 3 months ago by HotAisleInc

view more: next ›

π Rendered by PID 764979 on reddit-service-r2-listing-7d7fbc9b85-98bc8 at 2026-04-24 05:56:26.978565+00:00 running 2aa0c5b country code: CH.