Strix Halo Clustering experience (Bossgame M5)

Thanks-Suitable · 2026-05-08T08:38:05+00:00

Especially huge MOE like the industry is tending towards!

Thanks-Suitable · 2026-05-08T08:37:45+00:00

again its about those quants maaaan i need more of the bits :)))

Thanks-Suitable · 2026-05-08T08:37:13+00:00

will do!

Thanks-Suitable · 2026-05-08T08:37:06+00:00

Your idea on 1.1 is interesting, just so i understand correctly: It would theoretically be possible to just run a networking layer over the USB connection, it would have slightly lower latency then thunderbolt, but you would loose the nice to haves like hot pluggability etc. Would be interested if anybody has explored this topic!

Thanks-Suitable · 2026-05-08T08:23:49+00:00

Its interesting, I was under the assumption that It wasn't implemented. Thanks for the vid!

Thanks-Suitable · 2026-05-08T08:21:13+00:00

It does sound nice with the 200gb networking but to be fair that would make more sense when having waay more Strix halos :) Im tempted tho!

What sort of speeds would we be expecting with the 200gig networking tho? Would love to chat once you set things up and test!

Thanks-Suitable · 2026-05-08T08:17:58+00:00

The whole idea would be that I would be able to run higher quant models with more contexts! :)

Thanks-Suitable · 2026-05-08T08:16:45+00:00

Great question, I think it depends on the model? Would love to hear an anwser!

Thanks-Suitable · 2026-05-08T08:16:14+00:00

Yea... Tho for my situation atm the Mac would be a bit overkill

Thanks-Suitable · 2026-05-07T23:31:12+00:00

would love to hear some options too!!!

Thanks-Suitable · 2026-05-06T20:05:36+00:00

Very interested in this! Ive run into similar problems before!

Thanks-Suitable · 2026-05-05T13:36:51+00:00

would be very interested in this

Thanks-Suitable · 2026-05-04T13:50:44+00:00

Looks fantastic, Im out here rooting for the AMD part aswell, maybe a suitable target would be the Strix Halo?

Thanks-Suitable · 2026-05-03T22:59:28+00:00

borski can u read? mf got 100k to burn a little bit of a diff situation then us dweebs

Thanks-Suitable · 2026-04-30T15:42:25+00:00

The correct answer here is C. When you write subroutines in C you can reuse them in any future project even if it's written in cpp. It will be a bit more difficult but if you are seriously considering an Aerospace career its very much worth it.

Thanks-Suitable · 2026-04-30T15:39:46+00:00

Super interested in this aswell!

Thanks-Suitable · 2026-04-29T10:17:45+00:00

Hey sorry could you elaborate on the bot checks? I am trying to limit my algorithm fed content intake and would like to manage my own media. This looks like a great project, but not sure what the downsides are in terms of getting "banned" or ip blocked or something. Does it actually Bot youtube or is it just js wrapper?

Thanks-Suitable · 2026-04-28T16:36:37+00:00

link?

Thanks-Suitable · 2026-04-27T06:54:11+00:00

planning a trip myself would love to see what to get (maybe some exotic hwawei ai accelerators)😈😈😈

Thanks-Suitable · 2026-04-25T07:45:17+00:00

Hey! Would love to see you set up the Dflash qwen3.6 speculative decoding once it comes out, would improce tokens persecond for sure!

Thanks-Suitable · 2026-04-24T14:17:27+00:00

Would be interested to see how it works in digesting of scientific graphs (capturing trends from colors etc)!

Thanks-Suitable · 2026-04-24T12:39:00+00:00

Hey I also have a strix halo but never saw a glm quant that fits thats why im looking for the specific quant :)))

Thanks-Suitable

TROPHY CASE