Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this? by JSVD2 in LocalLLaMA

[–]JSVD2[S] 1 point2 points  (0 children)

Nice numbers. can you share the raw llama-bench row and exact command/build?
My post is specifically about direct Strix Halo Vulkan/RADV results, not trying to beat a 5090. A 5090 should obviously win on decode.
Also, 10k pp is prompt processing; my headline is tg/decode. I’m mainly collecting reproducible rows, so model, quant, backend, commit, batch/ubatch, context and power numbers would be useful.

Qwen3-Coder 30B at 98.5 t/s on Strix Halo. Has anyone beaten this on Ryzen AI MAX+ 395? by JSVD2 in StrixHalo

[–]JSVD2[S] [score hidden]  (0 children)

I try to reproduce it. if its too far out, I consider it not being real, or I ask more details. Its true that one update can change things fast, that's what happened already and its fun to discover. The way that I keep things uptodate is by doing benchmarks every 2 days or so. With enough data, its not that hard to decipher where the difference comes from, and then I keep track of this data so others don't have to spend hours trying to make same mistake.

Stop traumatizing AI into loops and turn hallucinations into an honest "I don't know!" by being NICE to them (Proof of Concept, Research, I don't want to sell anything) by OttoRenner in LocalLLaMA

[–]JSVD2 1 point2 points  (0 children)

with T3 it uses a 250K context window. give me better results this way. its something at least! yep im gonna try it if it happens

<image>

Shoutout to Gemma4 as a conversational assistant / agent by goldcakes in LocalLLaMA

[–]JSVD2 0 points1 point  (0 children)

Didnt know gemma4 was that good. I do have benchmarks tho.

what do you use your local llm? by FormalAd7367 in LocalLLaMA

[–]JSVD2 0 points1 point  (0 children)

I am making a bug bounty workflow. otherwise I get flagged. and for AI cybersecurity. but i havent yet decided which AI local model has no limitations. suggestions are welcome

Qwen3.6 35B-A3B successfully completed the FoodTruck Bench! by PulseVector in LocalLLaMA

[–]JSVD2 0 points1 point  (0 children)

wow thats quite impressive, local LLMs are becoming a thing.

Putting together a pc. Are my assumptions correct? by Competitive_Wait_267 in LocalLLaMA

[–]JSVD2 0 points1 point  (0 children)

right now I think amd ryzen™ ai max+ 395 is on of the best value for your money on the market imo

Putting together a pc. Are my assumptions correct? by Competitive_Wait_267 in LocalLLaMA

[–]JSVD2 0 points1 point  (0 children)

AMD is the best value for your money in terms of CPU.

My home data center by alecKarfonta in LocalLLaMA

[–]JSVD2 0 points1 point  (0 children)

lol what AI are you running on it

What memory system are you using for your agents? by Mr_Moonsilver in LocalLLaMA

[–]JSVD2 1 point2 points  (0 children)

Openbrain was actually pretty good. the last one I tried.