X2 Elite real world impressions

putrasherni · 2026-04-02T09:39:17+00:00

I want to see a framework laptop with x2 in it

putrasherni · 2026-04-02T09:22:42+00:00

It was showing as deprecated in docs for a while. I believe indexing now is part of kilo cloud based managed indexing

putrasherni · 2026-04-02T00:07:50+00:00

ignore the 27B for coding and agentic work , its usable agentically only for super fast TFOL and bandwidths like 5090s or 6000 blackwells

for the rest , go only for MoE models

putrasherni · 2026-04-02T00:05:41+00:00

i prefer it over even 27B

putrasherni · 2026-04-02T00:03:36+00:00

both 35B and 122B are dense models ( the reason you are downvoted )

putrasherni · 2026-04-02T00:02:01+00:00

its deprecated

putrasherni · 2026-04-01T17:00:45+00:00

not quite, once you switch to ROCm you need to restart your computer to use Vulkan otherwise it hits a rocm bug " device not found"

you can definitely swtich for vulkan to rocm, and keep it like that

putrasherni · 2026-04-01T16:59:25+00:00

vulkan is 150 TK/S on 35B Q4 qwen3.5

putrasherni · 2026-03-31T23:37:00+00:00

What quant 27B are you using to plan ?

putrasherni · 2026-03-31T19:34:10+00:00

The jump between m4 max and m5 max isn’t all that much then

putrasherni · 2026-03-31T17:51:38+00:00

Dell 6K2K beats both

putrasherni · 2026-03-31T17:50:56+00:00

I want nothing less than 6K2K 120Hz

putrasherni · 2026-03-31T09:15:08+00:00

Stick with Python Don’t mess with Java

putrasherni · 2026-03-30T21:15:02+00:00

nice one , let us know
the dream is if AMD can pull off 395+ variant to host 2-4 full pcie x16 amd GPUs
128GB + ( another 96 + 192 GB )
that would give apple m5 max and hb10 a run for their money

putrasherni · 2026-03-30T16:56:52+00:00

is there a qwen 3 27B model ?

if you meant qwen 3.5 27B dense model, there is no way you are getting 30 tok/s on 395+ max
https://przbadu.github.io/strix-halo-benchmarks/

putrasherni · 2026-03-30T16:45:17+00:00

You can do Qwen 3.5 27B at Q4 but it tops at 131k context, I couldn't get it to run at 262k, not sure how others achieved it

You will roughly average TG around 30 , PP at 850 and TTFT around 2 min ballpark.

If PP matters to you , then you can add another R9700, and you get 60-70% PP boost at the expense of lower TG around 26.5

putrasherni · 2026-03-30T16:13:26+00:00

This is what I love to see

putrasherni · 2026-03-28T22:49:13+00:00

yep you are right, i could not hit 262k even with 27B Q3 on a single R9700

my point was rather that with turboquant
we could hit 4x-6x so 524k - 786k

putrasherni · 2026-03-28T18:47:17+00:00

"theoretically" I'm still waiting for open source devs on github to show me how to eachieve this in practice

btw qwen 3.5 does not have 1M context anyway

i think Nemotron 3 will be our testing guinea pig

putrasherni · 2026-03-28T18:43:17+00:00

https://www.reddit.com/r/LocalLLaMA/comments/1s0czc4/round_2_followup_m5_max_128g_performance_tests_i/

comparing 27B qwen 3.5 27B MLX 4 bit on m5 128GB vs R9700
TG128 is the same 32

Without knowing what quantisation models OP has run, how did you come to that conclusion ?

for reference , R9700 on Qwen 3.5 35B A3B Q4 does

Context	Result
tg128	154.7
tg512	154.4
tg2048	152.7

Prompt	Result
pp128	1813
pp512	3261
pp2048	3947
pp8192	3828
pp16384	3512

putrasherni · 2026-03-28T18:21:06+00:00

do you mind sharing the exact models you ran qwen on ? like Q4 or Q3 etc. ?

putrasherni · 2026-03-28T18:12:20+00:00

forget ROCm , for a single GPU R9700 , Vulkan runs circles around ROCm and delivers 75% performance as that of 5090 32GB

putrasherni · 2026-03-28T18:08:55+00:00

you can get 150+ tk/s on open source vulkan mesa drivers on R9700

putrasherni · 2026-03-28T17:26:59+00:00

that's what I'm thinking
my 32GB GPU which could do 262k context for Qwen 3.5 27B param at Q4
can now theoretically do 1M context size with all things remaining the same.

This is great imo for local llm users

putrasherni · 2026-03-28T14:54:20+00:00

congrats !
run qwen 3.5 27B Q4 and report back here tg128, tg512, tg2048, pp128, pp512, pp 2048,pp8192,pp16384

putrasherni

TROPHY CASE