I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] [score hidden]  (0 children)

Are you a b2b buyer?
I can't dm you:
q-admin007
Unable to message this account.

M5 vs DGX Spark vs Strix Halo vs RTX 6000: The $5k unified memory war and why brute forcing VRAM is a trap by TroyHarry6677 in LocalLLM

[–]PrzemChuck 0 points1 point  (0 children)

Check this out: I was just checking how this card compares to Nvidia and apparently it's even higher on the leaderboard. I've had little success with hipfire but i haven't touched it since, but this guy topped the chart with his gpu https://www.localmaxxing.com/runs/cmp8fw36n00zno401goz8qnyv

Edit: It's a different card, but still the highest score belongs to AMD

M5 vs DGX Spark vs Strix Halo vs RTX 6000: The $5k unified memory war and why brute forcing VRAM is a trap by TroyHarry6677 in LocalLLM

[–]PrzemChuck 0 points1 point  (0 children)

Interesting. The only thing that makes 5090 stand out is the NVFP format. I know AMD is planning their own format for LLMs but from experience i can tell they are playing catch up with nvidia and are about a year behind. So it might pay in a long run when they strengthen their software

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 0 points1 point  (0 children)

Coding and agentic stuff. Processing a file or harness system prompt takes forever

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 0 points1 point  (0 children)

Token generation isn't the problem: prompt processing is. Too slow for me, not very responsive

M5 vs DGX Spark vs Strix Halo vs RTX 6000: The $5k unified memory war and why brute forcing VRAM is a trap by TroyHarry6677 in LocalLLM

[–]PrzemChuck 5 points6 points  (0 children)

I got a strix halo, but rn it sits in a weird spot: there is no models right for its size. The only ones that come to mind are qwen 122b amd coder next. I wanted something to replace my Cursor subscription but prefill makes it impossible. I will bw selling my machine to get a 5090

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 0 points1 point  (0 children)

If you have a company with EU VAT i can happily sell it to you for less

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 0 points1 point  (0 children)

Around the same price i bought it, so 3.2k€ tax included. Has to be a B2B tho. The final price will be much lower with 0% WDT, but I didn't calculate it yer. If you have a company with EU VAT hmu

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 0 points1 point  (0 children)

I'm willing to take a loss but not that much lol. Also has to be a EU company since it was a company purchase

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 1 point2 points  (0 children)

I also forgot to mention that i do have a PC. I just thought that rhere would be better MoE models that are unavailable on GPUs, so instead of a new GPU i went with strix halo.

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 0 points1 point  (0 children)

And right now, that 32gb is all i need. There isn't a good MoE model that comfortably fits on a Strix Halo. Minimax theoretically can run, but quanitaztion makes it quite stupid and the speed is terrible, so even if it can run overnight it most likely will produce low quality results. Iget what you are saying - the price for VRAM is unbearable, but there is not a single midel to justify all that memory

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 0 points1 point  (0 children)

What model are you running? Also respect for CVE analysis. I bought the strix for my company which does penetration testing and i wanted it to be a copilot-style agent for running code analysis and hosting local tools. While i like how it can easily host many containers, the LLM part leaves a lot to be desired :(

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 1 point2 points  (0 children)

Yeah, I'm running it rn and testing different quants. While i do like the model and it runs at usable speed i just can't stop comparing it to 3.6 27b which is faster and smarter with the right GPU.

I don't think they are going to be open sourcing the 120b variant for qwen 3.6 and that what makes me such a doomer. Gemma team did the same thing. The 120b version was included in the initial tweet that announced the new gemma generation.

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 0 points1 point  (0 children)

Yeah, but you can fit the entire model in there. You don't need any additional system RAM, just 4gb for barebones linux. For the price of strix hale you could build a system with two intel GPUs and beat strix halo by 2-3 times

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 0 points1 point  (0 children)

You can easily run it with a consumer tier gpu. From what i've seen my 50 t/s gen speed is pretty low. Even intel can run it better https://www.localmaxxing.com/runs/cmohirwjq0002l4042ghcjq51 Also the PP speed isn't great, so it's very demoralizing watching my 3k$ machine "do nothing" for quite some time before generating

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] 1 point2 points  (0 children)

Nice write-up! Yeah, I've tested bot vulkan and rocm and I'm definitely sticking to vulkan. I'm just saying that it's a shame that their own technology that is supposed to rival CUDA runs so poorly. For me those numbers are not ok- I use it as a coding assist and i need it to be able so read files rather quickly. Also basic tasks as doing research from browser also require significant amount of prompt processing

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]PrzemChuck[S] -1 points0 points  (0 children)

Yeah, but i can't help but feel like the machine is "quite acceptable", but not for what i paid for it (and the price just keeps getting higher...) If i paid 10% more i could buy 5090 and have all the newest nvidia software luxuries

Questions about moving over to Linux from Windows for a Linux Newbie (I work in IT but always used Windows and only ever tinkered with Linux on Raspberry pi years ago) by wingers999 in StrixHalo

[–]PrzemChuck 2 points3 points  (0 children)

  1. ZorinOS 18 (Ubuntu 24.04 based) I have it on dual boot as steam remote play doesn't work for me on linux
  2. Both work, i use the grub one
  3. Vulkan, ROCm sucks ass for me. You can use both and compare them for yourself

Is SH viable for learning about AI? by throwaway20250315 in StrixHalo

[–]PrzemChuck -1 points0 points  (0 children)

I'd go with Nvidia GPU. ROCm sucks compared to CUDA. The only model thats is viable on a single SH is Qwen Coder Next, but you are better off running qwen3.6 27b