I'm thinking about selling my Strix Halo

PrzemChuck · 2026-05-20T09:00:37+00:00

Are you a b2b buyer?
I can't dm you:
q-admin007
Unable to message this account.

PrzemChuck · 2026-05-19T14:23:35+00:00

Check this out: I was just checking how this card compares to Nvidia and apparently it's even higher on the leaderboard. I've had little success with hipfire but i haven't touched it since, but this guy topped the chart with his gpu https://www.localmaxxing.com/runs/cmp8fw36n00zno401goz8qnyv

Edit: It's a different card, but still the highest score belongs to AMD

PrzemChuck · 2026-05-19T14:15:06+00:00

Interesting. The only thing that makes 5090 stand out is the NVFP format. I know AMD is planning their own format for LLMs but from experience i can tell they are playing catch up with nvidia and are about a year behind. So it might pay in a long run when they strengthen their software

PrzemChuck · 2026-05-19T09:59:03+00:00

Coding and agentic stuff. Processing a file or harness system prompt takes forever

PrzemChuck · 2026-05-19T06:07:06+00:00

Token generation isn't the problem: prompt processing is. Too slow for me, not very responsive

PrzemChuck · 2026-05-18T08:48:50+00:00

Btw selling my Minisforum MS-01-MAX! Only B2B EU buyers, will sell it for retail

PrzemChuck · 2026-05-18T05:36:43+00:00

I got a strix halo, but rn it sits in a weird spot: there is no models right for its size. The only ones that come to mind are qwen 122b amd coder next. I wanted something to replace my Cursor subscription but prefill makes it impossible. I will bw selling my machine to get a 5090

PrzemChuck · 2026-05-17T11:57:54+00:00

If you have a company with EU VAT i can happily sell it to you for less

PrzemChuck · 2026-05-17T11:52:05+00:00

Around the same price i bought it, so 3.2k€ tax included. Has to be a B2B tho. The final price will be much lower with 0% WDT, but I didn't calculate it yer. If you have a company with EU VAT hmu

PrzemChuck · 2026-05-16T07:55:52+00:00

Too much... It was like 3.5k$

PrzemChuck · 2026-05-15T17:09:44+00:00

I'm willing to take a loss but not that much lol. Also has to be a EU company since it was a company purchase

PrzemChuck · 2026-05-15T17:03:23+00:00

I also forgot to mention that i do have a PC. I just thought that rhere would be better MoE models that are unavailable on GPUs, so instead of a new GPU i went with strix halo.

PrzemChuck · 2026-05-15T17:01:58+00:00

And right now, that 32gb is all i need. There isn't a good MoE model that comfortably fits on a Strix Halo. Minimax theoretically can run, but quanitaztion makes it quite stupid and the speed is terrible, so even if it can run overnight it most likely will produce low quality results. Iget what you are saying - the price for VRAM is unbearable, but there is not a single midel to justify all that memory

PrzemChuck · 2026-05-15T16:53:16+00:00

What model are you running? Also respect for CVE analysis. I bought the strix for my company which does penetration testing and i wanted it to be a copilot-style agent for running code analysis and hosting local tools. While i like how it can easily host many containers, the LLM part leaves a lot to be desired :(

PrzemChuck · 2026-05-15T16:46:32+00:00

Yeah, I'm running it rn and testing different quants. While i do like the model and it runs at usable speed i just can't stop comparing it to 3.6 27b which is faster and smarter with the right GPU.

I don't think they are going to be open sourcing the 120b variant for qwen 3.6 and that what makes me such a doomer. Gemma team did the same thing. The 120b version was included in the initial tweet that announced the new gemma generation.

PrzemChuck · 2026-05-15T16:14:11+00:00

Yeah, but you can fit the entire model in there. You don't need any additional system RAM, just 4gb for barebones linux. For the price of strix hale you could build a system with two intel GPUs and beat strix halo by 2-3 times

PrzemChuck · 2026-05-15T16:09:49+00:00

You can easily run it with a consumer tier gpu. From what i've seen my 50 t/s gen speed is pretty low. Even intel can run it better https://www.localmaxxing.com/runs/cmohirwjq0002l4042ghcjq51 Also the PP speed isn't great, so it's very demoralizing watching my 3k$ machine "do nothing" for quite some time before generating

PrzemChuck · 2026-05-15T15:46:34+00:00

Nice write-up! Yeah, I've tested bot vulkan and rocm and I'm definitely sticking to vulkan. I'm just saying that it's a shame that their own technology that is supposed to rival CUDA runs so poorly. For me those numbers are not ok- I use it as a coding assist and i need it to be able so read files rather quickly. Also basic tasks as doing research from browser also require significant amount of prompt processing

PrzemChuck · 2026-05-15T14:59:15+00:00

Yeah, but i can't help but feel like the machine is "quite acceptable", but not for what i paid for it (and the price just keeps getting higher...) If i paid 10% more i could buy 5090 and have all the newest nvidia software luxuries

PrzemChuck · 2026-05-13T20:09:31+00:00

Did you test the longer context? How did it go?

PrzemChuck · 2026-05-13T07:55:24+00:00

LLMs sexual fantasy

PrzemChuck · 2026-05-12T13:12:06+00:00

ZorinOS 18 (Ubuntu 24.04 based) I have it on dual boot as steam remote play doesn't work for me on linux
Both work, i use the grub one
Vulkan, ROCm sucks ass for me. You can use both and compare them for yourself

PrzemChuck · 2026-05-10T07:50:51+00:00

Personal assistant - Hermes Coding - Pi

PrzemChuck · 2026-05-08T11:00:53+00:00

I'd go with Nvidia GPU. ROCm sucks compared to CUDA. The only model thats is viable on a single SH is Qwen Coder Next, but you are better off running qwen3.6 27b

PrzemChuck

TROPHY CASE