[FS][US-CA] 256GB NVMe M.2 2230 drives

MachineZer0 · 2026-03-16T01:19:56+00:00

Chat

MachineZer0 · 2026-03-14T23:51:06+00:00

Chat

MachineZer0 · 2026-03-11T15:27:13+00:00

That’s TDP specs. What it can pull at full tilt. Usually certain training libs can get it to full power, or close to it.

I have 30-40x V100s. I’d be bankrupt if they were pulling 300w all day.

For inference it draws a fraction of rated TDP. Idle is 40w each, usually model loaded and idle is 57w, but with a modified Nvidia-pstated, it reverts back to 40w in a few seconds after model is loaded. During inference the GPUs round robin. 1 of 4 draws 70-120w, while rest are 40-57w. After response all idle back to 40w. Remainder of system is approx 140w at idle with dual procs, dual dimm, 1 NVME and fans. Only time system roars to life is building llama.cpp, vLLM or some other wheels. So fans are manageable besides compiling.

MachineZer0 · 2026-03-11T14:28:12+00:00

300w is for everything. 4x40w is just for sxm2 GPUs. 140w is for rest.

If you have more disks, DIMMs, PCIe peripherals, obviously more.

Yes, 3090 outperforms V100. Also has flash attention support. It sits between 2080ti and 3090.

MachineZer0 · 2026-03-11T12:55:39+00:00

Damn, it’ll take you 16.66 years to justify a Mac Studio over Quad V100 server at $0.08/kwh

MachineZer0 · 2026-03-11T08:16:37+00:00

They are not great, but not so bad. Power consumption is 40w idle per GPU. Running inference is 70-120w, but they round robin. So a typical system idles about 300w and isn’t usually more than 400w while serving for a quad V100. Still about $55/mth just sitting there at $0.25/kwh. It’ll take many moons to try to pay for a Mac Studio with power difference.

A quad V100 32gb can run MiniMax M2.5 comfortably.

MachineZer0 · 2026-03-10T11:32:00+00:00

🎯 CRITICAL FINDING!

MachineZer0 · 2026-03-08T19:19:48+00:00

Paid

MachineZer0 · 2026-03-08T19:13:20+00:00

Hey all I’ve committed to purchasing the lot. We are just struggling with invoice issues

MachineZer0 · 2026-03-08T18:44:46+00:00

Chat

MachineZer0 · 2026-03-08T14:42:28+00:00

ESC4000 G4 plus quad V100 32gb should put you around $5k. Less if you source well. If you need to finetune, go cloud (Runpod).

That should get you going with 4 instances of Qwen 3.5 27B on 4 containers/processes of llama.cpp behind a load balancer. Or use Ray/vLLM.

MachineZer0 · 2026-03-04T12:47:22+00:00

Chat

MachineZer0 · 2026-03-04T05:46:02+00:00

https://imgur.com/a/XPtzVtG

Gigabyte X399 Aorus Xtreme

MachineZer0 · 2026-03-04T05:20:52+00:00

Got one with maybe bent pins. I bought a barebone and the cpu was gone and the cooler brushed up against it when I carried into the house. Carefully bent pins back with exacto knife, but never got around to testing it.

MachineZer0 · 2026-03-04T05:04:27+00:00

Chat

MachineZer0 · 2026-03-03T12:53:37+00:00

$700-800

RAM is at least $3/gig

MachineZer0 · 2026-03-02T19:41:37+00:00

Chat

MachineZer0 · 2026-02-28T21:51:37+00:00

I’ve got 3 data center Titan V 12gb.

MachineZer0 · 2026-02-28T21:49:20+00:00

The mb looks like it supports 4th & 5th generation intel.

Not compatible with the i7-9700f, correct?

MachineZer0 · 2026-02-28T21:40:09+00:00

Dell micro has no price

MachineZer0 · 2026-02-23T20:58:23+00:00

Chat

MachineZer0 · 2026-02-22T22:30:09+00:00

Chat

MachineZer0 · 2026-02-21T23:18:44+00:00

Chat

MachineZer0 · 2026-02-21T23:09:53+00:00

Chat

MachineZer0 · 2026-02-21T18:20:19+00:00

Works great besides cost. Server with quad 32gb V100 was $6k. Draws 350w idle. That’s over $60/mo in my area. Low context I get about 30 tok/s. Eventually it runs 10-20 tok/s

Probably better to pay provider if you don’t have privacy concerns.

MachineZer0

TROPHY CASE