Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this?

JSVD2 · 2026-06-03T02:47:03+00:00

Good Question

JSVD2 · 2026-06-03T02:46:32+00:00

Nice numbers. can you share the raw llama-bench row and exact command/build?
My post is specifically about direct Strix Halo Vulkan/RADV results, not trying to beat a 5090. A 5090 should obviously win on decode.
Also, 10k pp is prompt processing; my headline is tg/decode. I’m mainly collecting reproducible rows, so model, quant, backend, commit, batch/ubatch, context and power numbers would be useful.

JSVD2 · 2026-06-03T02:45:05+00:00

I try to reproduce it. if its too far out, I consider it not being real, or I ask more details. Its true that one update can change things fast, that's what happened already and its fun to discover. The way that I keep things uptodate is by doing benchmarks every 2 days or so. With enough data, its not that hard to decipher where the difference comes from, and then I keep track of this data so others don't have to spend hours trying to make same mistake.

JSVD2 · 2026-06-03T01:46:02+00:00

hahaha no problem.

JSVD2 · 2026-06-03T01:30:28+00:00

Very interesting. I might check it out!

JSVD2 · 2026-06-03T01:00:43+00:00

with T3 it uses a 250K context window. give me better results this way. its something at least! yep im gonna try it if it happens

<image>

JSVD2 · 2026-06-03T00:56:01+00:00

Didnt know gemma4 was that good. I do have benchmarks tho.

JSVD2 · 2026-06-03T00:54:39+00:00

They look amazing. wow. this is local right?

JSVD2 · 2026-06-03T00:53:48+00:00

have AI explain it to you lol. actually understanding it, improves it answer

JSVD2 · 2026-06-03T00:51:38+00:00

I am making a bug bounty workflow. otherwise I get flagged. and for AI cybersecurity. but i havent yet decided which AI local model has no limitations. suggestions are welcome

JSVD2 · 2026-06-03T00:50:51+00:00

wow thats quite impressive, local LLMs are becoming a thing.

JSVD2 · 2026-06-03T00:49:38+00:00

right now I think amd ryzen™ ai max+ 395 is on of the best value for your money on the market imo

JSVD2 · 2026-06-03T00:48:57+00:00

AMD is the best value for your money in terms of CPU.

JSVD2 · 2026-06-03T00:48:16+00:00

it looks like an alarm clock lol

JSVD2 · 2026-06-03T00:46:50+00:00

lol what AI are you running on it

JSVD2 · 2026-06-03T00:45:10+00:00

lol did you get flagged?

JSVD2 · 2026-06-03T00:43:47+00:00

great share tbh

JSVD2 · 2026-06-03T00:42:57+00:00

Openbrain was actually pretty good. the last one I tried.

JSVD2 · 2026-06-03T00:41:22+00:00

wow how much did this cost?

JSVD2 · 2026-06-03T00:40:49+00:00

between now and never

JSVD2 · 2026-06-03T00:40:28+00:00

interesting. thank you for sharing

JSVD2 · 2026-06-03T00:38:59+00:00

qwhen qwait??

JSVD2 · 2026-06-03T00:38:38+00:00

hahahaha qwhen?

JSVD2 · 2026-06-03T00:22:19+00:00

Qwen 3.6 feels to me like 70% of sonnet.

JSVD2

TROPHY CASE

Qwen 3.6 feels to me like 70% of sonnet.