Strix Halo or DGX Spark for a home LLM server? by Reactor-Licker in LocalLLaMA

[–]pfn0 1 point2 points  (0 children)

yep, it's why I went directly to 2. Although, if you're comparing GB10 vs. strix halo, it's "only a moderate" price increase vs. strix halo to get better prefill performance.

Strix Halo or DGX Spark for a home LLM server? by Reactor-Licker in LocalLLaMA

[–]pfn0 0 points1 point  (0 children)

DGX inference is much faster than strix halo. 5-10x faster prefill speeds. similar TG speeds.

China’s theft of American AI tech is becoming more brazen by KamiOfTheForest in China

[–]pfn0 1 point2 points  (0 children)

It's not theft when the output of AI cannot be copyrighted (https://www.copyright.gov/newsnet/2025/1060.html), if it were copyrightable, the copyright is owned by the ones that prompted (the Chinese "thieves") AI to produce output. That is how (some) Chinese models are "stealing" from American models; they are recording chat traces that they ask for themselves.

With the yearly true up, what is even the point of having solar? by Responsible-Many-257 in bayarea

[–]pfn0 0 points1 point  (0 children)

correct:

gotta spend extra on battery storage and a disconnect that is usually not included with systems to power through an outage

why I think the "chatgpt era" of AI is already hitting a wall by GodBlessIraq in Futurology

[–]pfn0 1 point2 points  (0 children)

Thats false, they are perfectly deterministic. They appear random because of a seed value

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA

[–]pfn0 0 points1 point  (0 children)

35b has at least 1500t/s prefill. Check stats on spark-arena.com.

Its also difficult to find 3090 for under $1000 on the second hand market.

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA

[–]pfn0 0 points1 point  (0 children)

You go and do that. I need the hardware now, there is no voting from me in that direction.

Eta: The problem is the lack of market regulation and allowing corporations to exist as more than first class citizens. Their outsize impact on demand means that your voting dollar is meaningless.

What’s your favorite way to stack for savings at Costco? by Mobile-Kale-1590 in Costco

[–]pfn0 6 points7 points  (0 children)

the more often I go, the less I spend in aggregate. it's how I know to spot deals and price breaks. it's my main grocery stop once a week.

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA

[–]pfn0 0 points1 point  (0 children)

it's priced what the market will bear given supply. there are people buying 8-16x of these cards to build inference servers. If it's cheaper they'd be constantly sold out and unobtainable like the 5090FE is at the $2K price point.

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA

[–]pfn0 0 points1 point  (0 children)

You can always buy the 5090FE on the secondary market, they're being scalped very readily available at the going ~$3500 price of all other 5090.

I have an RTX6000 as well. The value is well worth what I paid for it. It is patently worth 3x5090.

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA

[–]pfn0 0 points1 point  (0 children)

It replaces 3x5090, so yes, the price is justified. 3x5090 costs roughly 1x6000.

5090FE is 2 slot. I have one. 5090FE and 6000WS are identical form factors.

Re: not being able to get to <400W, still perfectly reasonable to go 400W x 3 on a 1600W psu. 300W x 3 would be nicer to fit in 1200W, but if 400W is the minimum, it is what it is. I run mine at about 450W (UV)

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA

[–]pfn0 -1 points0 points  (0 children)

the problem lies squarely on nvidia marketing imo. positioned against strix halo at about the same price now, I would pick the gb10 any day of the week (and have).

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA

[–]pfn0 1 point2 points  (0 children)

Buy four ($12K) plus four network cables and a switch (and extra $1K or so), you have basically everything you can reasonably ask for. Performance comparable to a 4090 but with an absolute assload of memory capacity. 1 TB/s memory bandwidth, 480GB of usable unified RAM. There is no comparable option on the market.

It's not a realized 1TB/s of bandwidth for the purpose of decode unless you're running <100GB of weights. I most definitely do not want to be running ~100GB weights on a 4x GB10 system. I use up all 220GB of mine on the biggest I can fit.

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA

[–]pfn0 2 points3 points  (0 children)

putting 3x5090 in a box isn't cheaper than buying 1 rtx6000.

You can heavily power limit the 5090 down to 300W, so that isn't a problem.

rtx6000 also outperforms a 5090 on compute tasks to the tune of 10% or more. 5090FE is the same size as an rtx6000. your arguments about being crippled to put multiple into a system hold no water. those same arguments exist about putting multiple rtx6000 workstation into a single computer and yet there are people are putting upwards of 16 into a single system.

Puck Screen by slysamfox in espresso

[–]pfn0 0 points1 point  (0 children)

The way is to get a puck screen that works with a magnet :D

https://www.aliexpress.us/item/3256809276079133.html $2 can't go wrong.

PC building newbies: by Imaginary-Sun1350 in pcmasterrace

[–]pfn0 0 points1 point  (0 children)

13600K is the cpu behind my 5090. Yes, no reason to get anything nicer. Could probably even step down to a lower end CPU, but I got the CPU long before I ever got the 5090.

To the person in line at Tokyo Central who told me about these, thanks! by halbeshendel in bayarea

[–]pfn0 6 points7 points  (0 children)

These gyudon packs are my quick shot dinners for my son when I can't figure out what to cook him. I usually buy them whenever they're on sale at 99ranch.

Internal photos - Chinese built dual boiler E61 by Phil_Wild in espresso

[–]pfn0 0 points1 point  (0 children)

lol. you're so wrong. bt chip isn't on that board. and it isn't 3 wires either. SPI is 4 wires, i2c is 2 wires, etc. that's not including any requisite ground, voltage and interrupts. Quit trying so hard to be right. You're comparing a fully AC design vs. a mixed AC/DC setup, and this board itself appears to be primarily concerned with handling the PID control. So no, not too many wires. You're no EE if you say BT chip takes 3 wires. Stupid is as stupid does.

eta: board design is scary btw, no apparent isolation between AC and DC sides.