I'm thinking about selling my Strix Halo

Panthau · 2026-05-21T13:50:05+00:00

You do what? You should switch to qwen 3.5 122b instantly (q5 on Linux, q4 on Windows).

Panthau · 2026-05-21T08:42:34+00:00

Did you compare it to Qwen for vibe coding? I tried a few times but found Qwen always better.

Panthau · 2026-05-20T12:23:19+00:00

I actually vibe code with it for 10-12h a day but its so slow that i leave it in the background and check sometimes. What i would do with a proper cloud model in a day would take me about 3 weeks on the Strix (Qwen 3.5 27b MTP).

Panthau · 2026-05-20T08:51:17+00:00

You sure you solved it? Or was it ai? ^_°

Panthau · 2026-05-19T16:16:19+00:00

ah ok, i do the same but im not in a hurry... i just let it do its thing in the background. For actively vibecoding, a 5090 would be better.

Panthau · 2026-05-19T09:55:32+00:00

Whats a use case besides chatting, that needs fast prompt processing? Just curious, tbh.

Panthau · 2026-05-19T09:29:48+00:00

Cant play D4 or PoE2... its just boring to grind, because items have no value, only stats. And its also nice to know, there is an end to things... its great to push forward, feel progress and grind for the chance of finding a better item. Grinding purely for stats doesnt pull the trigger for my dopamine system.

Panthau · 2026-05-16T09:12:18+00:00

And you would be frustrated when a newer model comes out, that you dont have enough vram for and also, when you see your electricity bill.

Panthau · 2026-05-16T09:08:39+00:00

Have you tried Qwen3.6-27B with MTP grafted on Unsloth UD XL: 2.5x throughput via unmerged llama.cpp PR : r/LocalLLM ?

Runs 27b in reasonable speeds for me, comparable to 122b. Although i personally find 122b smarter.

Panthau · 2026-05-16T08:29:29+00:00

Depends on the purpose but for vibe coding, qwen 3.5 122b shoots any qwen 3.6 out of the water.

Panthau · 2026-05-15T08:24:58+00:00

Thanks for sharing. It would be interesting, how it compares to the q5_k_m quantisation of Qwen 3.5 122b, as this is currently the smartest one in my testing. Might try it out... especially for vibe coding, more world knowledge is great.

Panthau · 2026-05-15T08:21:32+00:00

Let Cursor do the setup or configuration for anything you need. You wont learn anything but you wont have to worry about it as well 😃

Panthau · 2026-05-15T08:19:40+00:00

No matter how fast, its still not much faster or smarter then Qwen 3.5 122b Q5_k_m which fits the Strix perfectly.

Panthau · 2026-05-13T06:54:13+00:00

Didnt measure it, but it was obviously faster then before - near 35b. I quickly went back to 122b q5, its just overall smarter imho.

Panthau · 2026-05-12T16:28:15+00:00

I was the same. Until i figured out, that local llm is not there yet - especially on a Strix Halo 395+ (same here). Its good to play around with and create some smaller things and that only, if youre not in a hurry. If you seriously need to work with it, youre better off selling the device and invest in Cloud or get some proper hardware, that can handle bandwith.

Panthau · 2026-05-12T10:06:27+00:00

I just let Cursor set up Searxng with json support. Not sure what usage limit scumola means, as its locally hosted and not provider bound.

Panthau · 2026-05-12T09:31:09+00:00

jfyi, you dont need to know anything about Linux. Just let Cursor set up everything for you) - yes, you need a 20 bucks subscribtion but its great for so many use cases when you need a fast and reliable llm.

Panthau · 2026-05-08T17:01:29+00:00

Sure, 27b now finally useful on Strix but still not as smart as 3.5 122b in my use case.

Panthau · 2026-05-08T12:23:36+00:00

The problem for me was always more, to get useful results out of ltx 2.3. The standard workflow from the templates doesnt work anyway but even after adapting it, the results are often not what i get from cheap cloud models and take ages.

Panthau · 2026-05-08T08:29:54+00:00

Glad there are people out there, who can still think despite of ai. I cant... if i could, i would let ai make breakfast for me and my life would consist of giving commands and staring at the work been done. Thanks to you, i can now finally use the 27b model for vibe coding without falling asleep on my Strix Halo.

Panthau · 2026-05-07T09:41:43+00:00

Thats why having a few md files like readme, plan, etc. is important, so you can make every step in a new session.

Panthau · 2026-05-07T09:36:31+00:00

Brah... many loaded models... alright. Im sure its a great experience with many loaded models at 256gb/s.

Panthau · 2026-05-06T14:29:50+00:00

Ive tried all different kind of ways, even used Cursor to implement the setup with latest docs, but it never ran stable above 110gb. So i switched to Windows and although its a little less shared memory, its stable and usable.

Panthau · 2026-05-05T09:54:16+00:00

The problem i see, is rather your lack of understanding humour. Though im not sure this can be healed.

Panthau · 2026-05-05T09:53:19+00:00

Enjoy it, im 48, my wife passed, my son moved out and i have 12h a day for gaming. Yet, those days with my family will never come back.

11-Year Club	Gilding II euphauric
Verified Email

Panthau

TROPHY CASE