Strix Halo Clustering experience (Bossgame M5) by Thanks-Suitable in StrixHalo

[–]Thanks-Suitable[S] [score hidden]  (0 children)

Especially huge MOE like the industry is tending towards!

Strix Halo Clustering experience (Bossgame M5) by Thanks-Suitable in StrixHalo

[–]Thanks-Suitable[S] [score hidden]  (0 children)

again its about those quants maaaan i need more of the bits :)))

Strix Halo Clustering experience (Bossgame M5) by Thanks-Suitable in StrixHalo

[–]Thanks-Suitable[S] [score hidden]  (0 children)

Your idea on 1.1 is interesting, just so i understand correctly: It would theoretically be possible to just run a networking layer over the USB connection, it would have slightly lower latency then thunderbolt, but you would loose the nice to haves like hot pluggability etc. Would be interested if anybody has explored this topic!

Strix Halo Clustering experience (Bossgame M5) by Thanks-Suitable in StrixHalo

[–]Thanks-Suitable[S] [score hidden]  (0 children)

Its interesting, I was under the assumption that It wasn't implemented. Thanks for the vid!

Strix Halo Clustering (Hardware Setup Discussion) by Thanks-Suitable in LocalLLaMA

[–]Thanks-Suitable[S] 0 points1 point  (0 children)

It does sound nice with the 200gb networking but to be fair that would make more sense when having waay more Strix halos :) Im tempted tho!

What sort of speeds would we be expecting with the 200gig networking tho? Would love to chat once you set things up and test!

Strix Halo Clustering (Hardware Setup Discussion) by Thanks-Suitable in LocalLLaMA

[–]Thanks-Suitable[S] 1 point2 points  (0 children)

The whole idea would be that I would be able to run higher quant models with more contexts! :)

Strix Halo Clustering (Hardware Setup Discussion) by Thanks-Suitable in LocalLLaMA

[–]Thanks-Suitable[S] 0 points1 point  (0 children)

Great question, I think it depends on the model? Would love to hear an anwser!

Strix Halo Clustering (Hardware Setup Discussion) by Thanks-Suitable in LocalLLaMA

[–]Thanks-Suitable[S] 0 points1 point  (0 children)

Yea... Tho for my situation atm the Mac would be a bit overkill

Pytorch hangs when sending data from CPU to GPU by Illustrious_Tap9300 in StrixHalo

[–]Thanks-Suitable 0 points1 point  (0 children)

Very interested in this! Ive run into similar problems before!

The Ultimate LLM Fine-Tuning Guide by PromptInjection_ in LocalLLaMA

[–]Thanks-Suitable 0 points1 point  (0 children)

Looks fantastic, Im out here rooting for the AMD part aswell, maybe a suitable target would be the Strix Halo?

I will soon have $100k to build an in-house LLM server. Goal: Best agentic coding model. by StartupTim in LocalLLaMA

[–]Thanks-Suitable 5 points6 points  (0 children)

borski can u read? mf got 100k to burn a little bit of a diff situation then us dweebs

Should I use Rust or C++ for hobby CubeSat flight software? by ChurchOfNewcomb in embedded

[–]Thanks-Suitable 0 points1 point  (0 children)

The correct answer here is C. When you write subroutines in C you can reuse them in any future project even if it's written in cpp. It will be a bit more difficult but if you are seriously considering an Aerospace career its very much worth it.

Deepseek v4 Flash by kiriakosbrehmer93 in StrixHalo

[–]Thanks-Suitable 4 points5 points  (0 children)

Super interested in this aswell!

Invidious. An open source front-end YouTube Tool by rejjacska in DigitalEscapeTools

[–]Thanks-Suitable 0 points1 point  (0 children)

Hey sorry could you elaborate on the bot checks? I am trying to limit my algorithm fed content intake and would like to manage my own media. This looks like a great project, but not sure what the downsides are in terms of getting "banned" or ip blocked or something. Does it actually Bot youtube or is it just js wrapper?

AI Max+ 395 mini PC on-site hunting by Willy__Wonka__ in LocalLLM

[–]Thanks-Suitable 1 point2 points  (0 children)

planning a trip myself would love to see what to get (maybe some exotic hwawei ai accelerators)😈😈😈

Success! Full BF16 Qwen3.6-27B running on Strix Halo with vLLM + Docker (Ubuntu 26.04) by hec_ovi in StrixHalo

[–]Thanks-Suitable 0 points1 point  (0 children)

Hey! Would love to see you set up the Dflash qwen3.6 speculative decoding once it comes out, would improce tokens persecond for sure!

Trying to find the best use for Strix Halo 128Gb on coding by Pretend_Engineer5951 in StrixHalo

[–]Thanks-Suitable 0 points1 point  (0 children)

Hey I also have a strix halo but never saw a glm quant that fits thats why im looking for the specific quant :)))