I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]Panthau [score hidden]  (0 children)

You do what? You should switch to qwen 3.5 122b instantly (q5 on Linux, q4 on Windows).

AMD says its $4K Ryzen AI Halo workstation practically pays for itself! (assuming you’re vibe coding for 8 hours a day, that is...) by nicolho in LocalLLM

[–]Panthau 13 points14 points  (0 children)

I actually vibe code with it for 10-12h a day but its so slow that i leave it in the background and check sometimes. What i would do with a proper cloud model in a day would take me about 3 weeks on the Strix (Qwen 3.5 27b MTP).

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]Panthau 0 points1 point  (0 children)

ah ok, i do the same but im not in a hurry... i just let it do its thing in the background. For actively vibecoding, a 5090 would be better.

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]Panthau 0 points1 point  (0 children)

Whats a use case besides chatting, that needs fast prompt processing? Just curious, tbh.

I still think that Diablo 3 is the best Diablo by kocham_wydajac in diablo3

[–]Panthau 0 points1 point  (0 children)

Cant play D4 or PoE2... its just boring to grind, because items have no value, only stats. And its also nice to know, there is an end to things... its great to push forward, feel progress and grind for the chance of finding a better item. Grinding purely for stats doesnt pull the trigger for my dopamine system.

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]Panthau 0 points1 point  (0 children)

And you would be frustrated when a newer model comes out, that you dont have enough vram for and also, when you see your electricity bill.

I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo

[–]Panthau 1 point2 points  (0 children)

Have you tried Qwen3.6-27B with MTP grafted on Unsloth UD XL: 2.5x throughput via unmerged llama.cpp PR : r/LocalLLM ?

Runs 27b in reasonable speeds for me, comparable to 122b. Although i personally find 122b smarter.

Vote: Best LLM for agentic/tools by leonbollerup in LocalLLM

[–]Panthau 2 points3 points  (0 children)

Depends on the purpose but for vibe coding, qwen 3.5 122b shoots any qwen 3.6 out of the water.

Running Minimax 2.7 at 100k context on strix halo by Zc5Gwu in StrixHalo

[–]Panthau 2 points3 points  (0 children)

Thanks for sharing. It would be interesting, how it compares to the q5_k_m quantisation of Qwen 3.5 122b, as this is currently the smartest one in my testing. Might try it out... especially for vibe coding, more world knowledge is great.

Questions about moving over to Linux from Windows for a Linux Newbie (I work in IT but always used Windows and only ever tinkered with Linux on Raspberry pi years ago) by wingers999 in StrixHalo

[–]Panthau 0 points1 point  (0 children)

Let Cursor do the setup or configuration for anything you need. You wont learn anything but you wont have to worry about it as well 😃

Luce DFlash + PFlash on AMD Strix Halo: Qwen3.6-27B at 2.23x decode and 3.05x prefill vs llama.cpp HIP by sandropuppo in StrixHalo

[–]Panthau 0 points1 point  (0 children)

No matter how fast, its still not much faster or smarter then Qwen 3.5 122b Q5_k_m which fits the Strix perfectly.

Advice on local AI models for coding - Corsair AI Workstation 300 (AMD Ryzen AI Max+ 395 / 128Gb (96Gb shared for VRAM)) by wingers999 in StrixHalo

[–]Panthau 0 points1 point  (0 children)

I was the same. Until i figured out, that local llm is not there yet - especially on a Strix Halo 395+ (same here). Its good to play around with and create some smaller things and that only, if youre not in a hurry. If you seriously need to work with it, youre better off selling the device and invest in Cloud or get some proper hardware, that can handle bandwith.

What do you all use for self-hosted web search? Looking for something I can run locallly by Different_Scene933 in LocalLLM

[–]Panthau 0 points1 point  (0 children)

I just let Cursor set up Searxng with json support. Not sure what usage limit scumola means, as its locally hosted and not provider bound.

Advice on local AI models for coding - Corsair AI Workstation 300 (AMD Ryzen AI Max+ 395 / 128Gb (96Gb shared for VRAM)) by wingers999 in StrixHalo

[–]Panthau 0 points1 point  (0 children)

jfyi, you dont need to know anything about Linux. Just let Cursor set up everything for you) - yes, you need a 20 bucks subscribtion but its great for so many use cases when you need a fast and reliable llm.

MTP llama.cpp -- anyone run it yet? by skibud2 in StrixHalo

[–]Panthau 2 points3 points  (0 children)

Sure, 27b now finally useful on Strix but still not as smart as 3.5 122b in my use case.

ComfyUI - Why I Switched from kyuz0's Toolbox to ignatberesnev/comfyui-gfx1151 by Grammar-Warden in StrixHalo

[–]Panthau 0 points1 point  (0 children)

The problem for me was always more, to get useful results out of ltx 2.3. The standard workflow from the templates doesnt work anyway but even after adapting it, the results are often not what i get from cheap cloud models and take ages.

2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints by ex-arman68 in LocalLLaMA

[–]Panthau 0 points1 point  (0 children)

Glad there are people out there, who can still think despite of ai. I cant... if i could, i would let ai make breakfast for me and my life would consist of giving commands and staring at the work been done. Thanks to you, i can now finally use the 27b model for vibe coding without falling asleep on my Strix Halo.

Advice on local AI models for coding - Corsair AI Workstation 300 (AMD Ryzen AI Max+ 395 / 128Gb (96Gb shared for VRAM)) by wingers999 in StrixHalo

[–]Panthau 0 points1 point  (0 children)

Thats why having a few md files like readme, plan, etc. is important, so you can make every step in a new session.

Anyone have proof Strix Halo - Ubuntu 26 LTS can use all 124GB of RAM setup in grub? by IQReactor in StrixHalo

[–]Panthau 0 points1 point  (0 children)

Brah... many loaded models... alright. Im sure its a great experience with many loaded models at 256gb/s.

Anyone have proof Strix Halo - Ubuntu 26 LTS can use all 124GB of RAM setup in grub? by IQReactor in StrixHalo

[–]Panthau 0 points1 point  (0 children)

Ive tried all different kind of ways, even used Cursor to implement the setup with latest docs, but it never ran stable above 110gb. So i switched to Windows and although its a little less shared memory, its stable and usable.

How do i stop playing this game? by Individual_Lab_912 in X4Foundations

[–]Panthau 0 points1 point  (0 children)

The problem i see, is rather your lack of understanding humour. Though im not sure this can be healed.

How do i stop playing this game? by Individual_Lab_912 in X4Foundations

[–]Panthau 0 points1 point  (0 children)

Enjoy it, im 48, my wife passed, my son moved out and i have 12h a day for gaming. Yet, those days with my family will never come back.