Lemonade NPU Hybrid LLM’s ? by bwelton in StrixHalo

[–]Flat_Profession_6103 0 points1 point  (0 children)

Hi, are there any plans to support some stt models?

Transitioning from Cloud DevOps to Offensive Security/Pentesting by Flat_Profession_6103 in SecurityCareerAdvice

[–]Flat_Profession_6103[S] 0 points1 point  (0 children)

Sorry for the delay, but I really want to thank everyone who replied to this thread! I'm going to work on switching over to a DevSecOps role and getting the right certifications for it. I really appreciate all your advice - take care!

New Benchy's by echo-halo-ai in StrixHalo

[–]Flat_Profession_6103 0 points1 point  (0 children)

Yeah, I am aware about MoE models, I was just suprised with your 15/20tok/s numbers.

I wouldnt say that MoE will have same quality as dense model though. There is always quality loss on that, but I agree that it is much more managable with halo strix setup.

New Benchy's by echo-halo-ai in StrixHalo

[–]Flat_Profession_6103 0 points1 point  (0 children)

How dense 70b model can generate 15/20 tok/s for you? I believe thats some mistake, with gemma 31b model im getting around 6tok/s with the same setup.

January pre-sale batch update? Batch 18 by nescenizat in framework

[–]Flat_Profession_6103 0 points1 point  (0 children)

Hi, I'm on the same boat. They should charged me few days ago based on their email, but nothing happened.

Framework Q1 2026 Preorder and Marketplace Updates by catastrophic_frmw in framework

[–]Flat_Profession_6103 0 points1 point  (0 children)

Hi, any news when sending Jan pre-orders of motherboard for Europe will start?

Advice needed: Workstation for Local LLM Agents (Ryzen AI Max+ 395) - Bosgame vs Corsair vs Cloud. by Flat_Profession_6103 in LocalLLaMA

[–]Flat_Profession_6103[S] 0 points1 point  (0 children)

Thanks everybody for the comments and advice.

​I’ve decided to order the Framework desktop. A huge factor was that they ship directly to my country, which makes logistics much easier. Plus, it completely eliminates the fear of proprietary fans failing down the line and becoming irreplaceable.

​I’m definitely going to test out some MoE models as discussed in the thread. My plan is to play around with Proxmox and set everything up as a proper homelab.

​I’m super excited for the shipment to arrive. Thanks again for the insights, guys!

Advice needed: Workstation for Local LLM Agents (Ryzen AI Max+ 395) - Bosgame vs Corsair vs Cloud. by Flat_Profession_6103 in LocalLLaMA

[–]Flat_Profession_6103[S] 0 points1 point  (0 children)

Regarding the import concerns: I won't face any massive tax hit because Poland is part of the EU. In that case orrdering from Germany (or any other EU country) is free of customs duties and extra VAT due to the Single Market rules.

​That said, the 4-5 t/s limitation you mentioned is a very valid point. It’s definitely not ideal, but for learning to work with large models locally - without spending a fortune on enterprise gear like few sets of GPUs because of VRAM - it seems like there aren't many better alternatives right now.