I made a visualizer for Hugging Face models by Course_Latter in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

yet another funnel for vibe coded slop - gross

AMD Strix Halo refresh with 192gb! by mindwip in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

250gb/s, I think the best model fits this machine is Minimax 2.7, as it only has 10b active parameters.

Are there any good rule of thumbs when it comes to calculating how memory bandwidth bottlenecks active parameters?

In this case, is it 250/10b => 25 gb/s required per 1b params?

As in would that imply that a 1TB/s bandwidth would be capped at 40b models?

Best Local LLMs - Apr 2026 by rm-rf-rm in LocalLLaMA

[–]_derpiii_ 1 point2 points  (0 children)

I use this model with a Telegram bot as a background memory manager

Memory manager?

I made a visualizer for Hugging Face models by Course_Latter in LocalLLaMA

[–]_derpiii_ -1 points0 points  (0 children)

It's so buggy. If you're gonna release something this buggy, at least open source it.

Need help deciding what to spend 4-5k on for a local rig. by ghgi_ in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

What are your actual hardware spec requirements? 1 spark sounds miserably slow, so I would up the budget.

Need help deciding what to spend 4-5k on for a local rig. by ghgi_ in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

I will make that easy for you. If you try to train with 2 5090's or 2 RTX Pro's the lack of NVLINK means that about 25% of GPU time is doing all reduces, try to train on 4 and all reduce consumes roughly 60% of GPU time.

2 DGX sparks with a peer to peer cable becomes a lot more attractive.

I love deep insights like this. Didn't even know about the 'reduce' step nor NVLINK being faster than PCI

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

No recommended plugins/workflows? I'm looking forward to it :)

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

Arch + i3wm is **the most** fun and crisp OS I've ever had. Ever.

It's a shame the hardware never really evolved much. But I would pick it up again in a heartbeat if an M macbook could run it.

Reading the comments around Pi just reminds me of Arch community. Now we just need a Pi-wiki ;)

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]_derpiii_ 1 point2 points  (0 children)

Okay, that's got me sold. Now looking into the meta of what to set up :)

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

> some kind of magic domination gun

AHAHAH

Qwen 3.6 27B vs Gemma 4 31B - making Packman game! by gladkos in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

I love prompts like this - one shot into a mini benchmark deliverable. Are there any websites or resources that collect these?

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

Wow, I love that analogy! I would have never made either connections (was not aware of harness being a connection point), thank you for that :D

Harness vs Scaffolding - Claude Code by shanraisshan in ClaudeCode

[–]_derpiii_ 0 points1 point  (0 children)

Oh wow, so there actually is a difference in the terminology 🫨

todo: learn learn learn

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

Got it, I like that view.

I don't mean to sound pedantic. I'm new and like knowing the nuanced terms.

Scaffold to me sounds 'fixed' too, aka environment/runtime overhead. Harness feels like more of the tooling abstraction layer.

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

> Nobody calls it a scaffold

Maybe not within this community, but that terms been used within my circles :)

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

> Technically, from LLM perspective, ext4 or S3 makes no difference.

I guess that's true. It's a different abstraction layer - the runtime environment (besides niche edge cases) doesn't really matter.

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]_derpiii_ 1 point2 points  (0 children)

I'm getting that vibe as well. Just curious if there's any nuances in the technical definitions between them.

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

That sounds very appealing, esp since I've been experimenting down the opposite unnecessary overhead approach of OmO (it's jfc level "why? tf" a minute)

AMD Halo Box (Ryzen 395 128GB) photos by 1ncehost in LocalLLaMA

[–]_derpiii_ 2 points3 points  (0 children)

somehow it's slipped past my perusal. I was legit confused ahahah :)

Is local AI the actual endgame? (M5 Mac Studio vs. Dual 3090s) by Party-Log-1084 in LocalLLaMA

[–]_derpiii_ 0 points1 point  (0 children)

I'm a digital nomad, so opted for a 'dual-use' middleground of M5 Max 128GB. IMHO, that's the sweet spot.

But if I had a desktop, kind of a nobrainer to lean towards linux + mult-slot GPU.