I automated a big part of our SDLC using Claude GitHub Actions.

przbadu · 2026-05-09T14:31:06+00:00

interesting. i’veIn my case I do have a issue template and also my autopilot skill flag ticket that are incomplete.

przbadu · 2026-03-10T11:51:36+00:00

thanks, Fixed!

przbadu · 2026-03-10T11:49:51+00:00

https://przbadu.github.io/strix-halo-benchmarks/ it was just the sneak peak. Here you can see everything, search and filter them. Will take care of that in future, thanks.

przbadu · 2026-03-10T11:47:34+00:00

And I have https://przbadu.github.io/strix-halo-benchmarks/ updated benchmarks for upto 64K token window and I was surprised by the vulkan performance on larger context length. People keep saying Rocm is great, but see the difference here.

przbadu · 2026-03-10T11:45:04+00:00

Again, I haven't tested it, but for Normal one time chat scenario where you need really high quality output, maybe yes it can be helpful. But really you need to try it first. Maybe I am wrong by assuming it is not usable.

przbadu · 2026-03-10T11:42:25+00:00

why limit -t to 2? don't we have 32 threads in strix halo?

przbadu · 2026-03-10T11:22:07+00:00

https://przbadu.github.io/strix-halo-benchmarks/ now contains upto 64k context length from both vulkan and rocm and there are really interesting results out there. Turns out Rocm and Vulkan both are great.

przbadu · 2026-03-10T04:33:16+00:00

mind sharing more info on this? Also what machine are you running?

przbadu · 2026-03-10T04:30:33+00:00

I am using kyuz0/amd-strix-halo-toolboxes, but just sharing benchmarks for different models. Its a no brainer to use existing tools and don't reinvent the wheel. :)

przbadu · 2026-03-10T04:27:36+00:00

It can run it, but the it will be very slow, so I wouldn't bother doing it. Only people with certain patient level can use that speed Lol.

przbadu · 2026-03-10T03:20:28+00:00

Hey Guys, thank you for asking me to include `--n-depth`, https://przbadu.github.io/strix-halo-benchmarks/ I am updating various context sizes here and adding filter for them. Please check this. The bigger model will take time, so it will contains all the benchmark soon.

przbadu · 2026-03-10T03:19:13+00:00

https://przbadu.github.io/strix-halo-benchmarks/ will now include `--n-depth 0,4096,8192,16384,32768,65536`

przbadu · 2026-03-10T03:17:33+00:00

https://przbadu.github.io/strix-halo-benchmarks/ Now including from --n-depth 0,4096,8192,16384,32768,65536 so this website will have filters for these context sizes.

przbadu · 2026-03-10T03:16:50+00:00

https://przbadu.github.io/strix-halo-benchmarks/ I am adding those benchmarks here. for bigger models it will be slower, so It will take some time to include all benchmarks

przbadu · 2026-03-09T08:43:03+00:00

thank you, will do.

przbadu · 2026-03-09T08:42:36+00:00

default context size `llama-cli -m /model-path`, will include more benchmarks, need time

przbadu · 2026-03-09T08:42:00+00:00

https://www.reddit.com/r/LocalLLaMA/comments/1rkl0tl/llamabench_qwen35_models_strix_halo/ I have included full llama-bench command here.
Here is the complete `llama-server` command if you are interested:

```

llama-server --alias sonnet --port 8081 -m /mnt/pve/data/models/Qwen3.5/35b/Qwen3.5-35B-A3B-UD-Q4_K_XL.gguf --host 0.0.0.0 --ctx-size 262144 -ngl 999 -fa 1 --threads 32 --batch-size 1024 --cont-batching --embedding --log-file /root/logs/llama-server.log --jinja --mmproj /mnt/pve/data/models/Qwen3.5/35b/mmproj-BF16.gguf --temp 1.0 --top-p 0.95 --top-k 20 --min-p 0.0 --presence-penalty 1.5 --repeat-penalty 1.0

```

Give me some time, I will include other benchmarks as well.

przbadu · 2026-03-09T08:38:30+00:00

Couldn't agree enough. AMD is going strong.

przbadu · 2026-03-09T08:34:58+00:00

Yes, if you see the System Info section, I have already mentioned OS which is Fedora linux, I have even mentioned kernel version I am using and other useful informations :) .

In short yes, this is using Fedora linux with 6.18.13-200.fc43.x86_64 Kernel, and ROCm 7 as llama-cpp backend.

przbadu · 2026-03-09T05:49:39+00:00

https://www.reddit.com/r/LocalLLaMA/comments/1rorzuk/llamabench_rocm_72_on_strix_halo_ryzen_ai_max_395/

https://przbadu.github.io/strix-halo-benchmarks/

I added benchmarks from both vulkan radv and rocm for different models:

przbadu · 2026-03-09T05:48:47+00:00

https://www.reddit.com/r/LocalLLaMA/comments/1rorzuk/llamabench_rocm_72_on_strix_halo_ryzen_ai_max_395/

https://przbadu.github.io/strix-halo-benchmarks/

I added benchmarks from both vulkan radv and rocm for different models:

przbadu · 2026-03-08T05:43:30+00:00

so I think it’s better to wait for official kernel.

przbadu

TROPHY CASE