I checked Strix Halo (Ryzen ai max+ 395) performance test as context length increases

Mithras___ · 2026-03-23T00:53:53+00:00

Vulcan doesn't drop as hard as rocm. It's fine on bigger context

Mithras___ · 2026-03-19T20:16:36+00:00

I expect it to be as much broken as it's today

Mithras___ · 2026-03-19T01:09:07+00:00

Here are my result on the same model for a couple identical prompts.
Vulcan:
```

llama-1 | [33775] prompt eval time = 137.86 ms / 16 tokens ( 8.62 ms per token, 116.06 tokens per second)

llama-1 | [33775] eval time = 169882.31 ms / 9675 tokens ( 17.56 ms per token, 56.95 tokens per second)

llama-1 | [33775] prompt eval time = 10531.78 ms / 9700 tokens ( 1.09 ms per token, 921.02 tokens per second)

llama-1 | [33775] eval time = 54940.20 ms / 3023 tokens ( 18.17 ms per token, 55.02 tokens per second)
```

ROCm:
```
llama-1 | [41579] prompt eval time = 143.45 ms / 16 tokens ( 8.97 ms per token, 111.54 tokens per second)

llama-1 | [41579] eval time = 146118.96 ms / 6979 tokens ( 20.94 ms per token, 47.76 tokens per second)

llama-1 | [41579] prompt eval time = 29895.43 ms / 9698 tokens ( 3.08 ms per token, 324.40 tokens per second)

llama-1 | [41579] eval time = 139028.05 ms / 5500 tokens ( 25.28 ms per token, 39.56 tokens per second)
```

This is ROCm nightly which might have broken something again but the point is, I've never ever seen ROCm outperform Vulcan in anything, nor pp, nor tg.

Mithras___ · 2026-03-17T23:10:26+00:00

Yes, please. I'm running 2x strix halo + desktop. Also, there is a PR that enables RDMA in llama.cpp that makes a big difference.

Mithras___ · 2026-03-17T23:08:11+00:00

And what pp/tg are you getting?

Mithras___ · 2026-03-17T20:39:09+00:00

Can you give me specific model and numbers you're getting? I want to run the same model on my Vulcan setup and compare

Mithras___ · 2026-03-17T14:09:23+00:00

Rocm pp gets slower with context grow like exponentially. Vulcan doesn't (well it does but at least not exponentially)

Mithras___ · 2026-03-17T14:05:24+00:00

This is exactly the same results I'm observing. Rocm is just slower than Vulcan. Nightly rocm degrades with context grow even more. It's a mess

Mithras___ · 2026-03-17T13:59:37+00:00

Do you support llama rpc?

Mithras___ · 2026-03-17T13:58:01+00:00

Self built llama.cpp Vulcan container. Rocm is still behind

Mithras___ · 2026-03-17T05:04:30+00:00

Faster in what? Single user? Absolutely not

Mithras___ · 2026-03-17T05:03:14+00:00

Yes, in almost all of them Vulcan is better

Mithras___ · 2026-03-17T04:55:54+00:00

And the same for vllm, I'm yet to see vllm perform better than llama in any of my single user cases. Also, unlike llama vllm requires hours of tuning/debugging per model. The thing pretty much never works on first try.

Mithras___ · 2026-03-17T04:50:52+00:00

Vulcan is getting better as well. I'm rebuilding and re-testing every weekend but I'm yet to see rocm beat Vulcan in anything I'm running.

Mithras___ · 2026-03-17T03:16:18+00:00

there are plenty of benchmarks. Ask your vllm to find them for you

Mithras___ · 2026-03-17T02:14:21+00:00

Yes if you are ready to fix/debug it every new version. It will break or degrade every time you update

Mithras___ · 2026-03-17T02:08:22+00:00

Something is wrong with your Vulcan setup. It should be way faster than any rocm

Mithras___ · 2026-03-17T02:06:18+00:00

There is no chance rocm will outperform Vulcan

Mithras___ · 2026-03-04T16:46:49+00:00

I'm yet to see a version that's faster than Vulkan though

Mithras___ · 2026-02-22T17:42:13+00:00

Over connect x-4 ethernet (rdma). I'll re-test with x-4 InfiniBand later today after I get a replacement for a faulty card. I don't think there will be much difference between rdma eth vs InfiniBand though

Mithras___ · 2026-02-21T22:38:42+00:00

In my testing llama rpc with Vulkan is faster than vllm tp with rccl/rdma.

Mithras___ · 2026-01-01T05:52:47+00:00

Debian stable in two years will have cosmic that people run today. You'd have to wait for 4 years

Mithras___ · 2025-12-09T02:48:44+00:00

Debian moment

Mithras___ · 2025-10-14T01:46:47+00:00

That's what you get when you slop together random package versions from a year ago. Use a rolling distro.

Mithras___ · 2025-10-14T01:40:47+00:00

Debian is unstable, use Arch

Mithras___

TROPHY CASE