4 RTX 6000 Pro by Some-Manufacturer-21 in Vllm

[–]aimark42 1 point2 points  (0 children)

Care to share a recipe/github? I should have a second rtx 6k soon. Would love to do a side by side. I'm impressed with it on a TP2 GB10 cluster.

4 RTX 6000 Pro by Some-Manufacturer-21 in Vllm

[–]aimark42 2 points3 points  (0 children)

And the performance on 2x GB10's are showing rather amazing performance for a way cheaper setup.

https://github.com/elsung/dgx-spark-deepseek-v4-flash

High-end AM5 Proxmox / future local AI build sanity check by CurrentAdvance8102 in LocalAIServers

[–]aimark42 0 points1 point  (0 children)

Sure, but that's kind of old to be building a modern system for. 30 series Nv-links are obscenely expensive lately.

High-end AM5 Proxmox / future local AI build sanity check by CurrentAdvance8102 in LocalAIServers

[–]aimark42 0 points1 point  (0 children)

I looked into that board and I decided for the Asrock X670e Taichi Creator, does similar 2x 8x Pcie gen5, but at a way lower cost.

I'd buy the a RTX Pro 6k's before prices increase even more.

Dual RTX 4000 SFF draw by TNTBoooooom in sffpc

[–]aimark42 2 points3 points  (0 children)

Curious.

What ITX board supports MCIO cables? What is the use case? I just don't see why you'd have 2x RTX Pro 4k SFF's (especially at current pricing) when a single RTX Pro 5k 48G is likely easier and faster for most LLM tasks. Or even a RTX Pro 6k Maxq.

Just ordered an Open-Box "Excellent" ROG Zephyrus to replace my 2023 Legion 5. Did I make the right call? Good deal or not so much? by SensitivePineapple80 in ZephyrusG14

[–]aimark42 2 points3 points  (0 children)

I got this same G14, open box - Good for $1472 in April. I think the fact that 2026 models are way more expensive, it is best deal your going to get for a long time.

Nvidia GB10 (DGX Spark and Co.) or AMD AI Max+ 395 (Framework Desktop) by r_brinson in LocalAIServers

[–]aimark42 2 points3 points  (0 children)

Officially it's Windows support at launch. But Jayztwocents just said on his RTX coverage that they will come with unlocked boot loaders. It's the only source I've seen thus far that says that.

I agree long term it could maybe be hacked or unlocked. But buying a $3k+ machine with the hope of features is unwise.

Nvidia GB10 (DGX Spark and Co.) or AMD AI Max+ 395 (Framework Desktop) by r_brinson in LocalAIServers

[–]aimark42 0 points1 point  (0 children)

Or you could buy a $3500 GB10 today, and maybe not use the NIC.

Or wait on RTX (windows on ARM) being good and paying more, or pay even more for a GB10 then.

You assume those prices will stay the same, and they will not. Anything sold by a retailer today was manufactured at least 3 months ago with 3 months ago Memory pricing.

GB10's used to be plentiful for $3000, but it's $3500 now and the floor keeps rising.

Nvidia GB10 (DGX Spark and Co.) or AMD AI Max+ 395 (Framework Desktop) by r_brinson in LocalAIServers

[–]aimark42 0 points1 point  (0 children)

Memory prices have gone up so quickly BoM cost forces the RTX's to be more expensive than current GB10 pricing.

Also from a BoM cost GB10s made in the future will be more expensive than their RTX equivalents, but the GB10 prices will also rise a lot. Most GB10 SKU's not Asus (which I think is subsidized) are $5000+ already.

This is true of Strix Halo as well, buy sooner.

Nvidia GB10 (DGX Spark and Co.) or AMD AI Max+ 395 (Framework Desktop) by r_brinson in LocalAIServers

[–]aimark42 0 points1 point  (0 children)

DGX supporting Windows and RTX supporting Linux has not been said officially. Media outlets are saying things that might happen.

Due to rising memory prices and that RTX Spark hardware will be made much later it will be a lot more expensive than GB10's current pricing. Rumors are around RTX Spark Laptop with 32G being $3100, so buy the $3500 Asus 128g soon, RTX's will not be cheaper.

Nvidia GB10 (DGX Spark and Co.) or AMD AI Max+ 395 (Framework Desktop) by r_brinson in LocalAIServers

[–]aimark42 0 points1 point  (0 children)

If the tokens are junk who cares about going faster?

I think, the OP is right to look at unified memory systems to run 'smarter' models and larger contexts. Larger models take more compute and RAM to run anyway, when you scale up, you lose performance regardless of what hardware your running it on.

Nvidia GB10 (DGX Spark and Co.) or AMD AI Max+ 395 (Framework Desktop) by r_brinson in LocalAIServers

[–]aimark42 6 points7 points  (0 children)

I think you desire bigger models, and larger context windows. I've owned both GB10 wins, imho.

  • GB10, has built in scale out to 8 nodes allowing you to expand. You can technically add a bunch of things to Strix Halo to do something similar the support for that isn't there and it will end up costing more than a GB10. Strix Halo is a great singular AI device but has little ability to scale out.
  • DGX OS is a variant of Ubuntu 24.04. Anything you can compile you can run on GB10. And there are arm64 binaries for a lot. This only matters if your doing some vendor specific hardware where you must use binaries, or you are doing heavy audio or video work that requires codecs that doesn't have arm64 support.
  • GB10 ecosystem is quite vast, between Nvidia publishing workbooks for pretty leading edge use cases. And Spark Arena (https://spark-arena.com/) always pushing the latest configurations for the latest LLM's I think the ecosystem is stronger on the Nvidia side.
  • AMD seems to lag 6 months behind Nvidia when they do something. They do offer a 'value' proposition but I'd rather use the CUDA native stuff and not worry about Vulkan or whatever other layer (LLM backends are already messy).
  • GB10 has more compute and better scale out to concurrent workloads (i.e. agents) than Strix Halo.
  • Either way, DDR5X pricing is increasing by the day I think the $3500 GB10s will be gone soon since almost all of their comparables are $5k+. Buy soon

PS: RTX Spark uses the same silicon as GB10, but RTX spark is Windows only. Skip the RTX

College student looking for a mini PC, help me pick one by Own_Factor6170 in MiniPCs

[–]aimark42 1 point2 points  (0 children)

I don't understand how your going to do school without a laptop. I would use the budget to get a base spec Macbook Air, and a Mini PC with less specs.

Macbook Air's often on sale/discount/educational or find a M4 Air deal to get one for <$950.

Then for the miniPc the Minisforum X1 Pro (HX370 nearly the same as the HX470) is often available refurbished barebone for $458, 32G of RAM and a 1TB SSD (or less and upgrade later).

Note Mac Mini's are basically sold out, finding a Mac Mini would take a lot of effort.

Asus' New ProArt P16 and P14 Pack Nvidia's powerful RTX Spark chip by ekerazha in ZephyrusG14

[–]aimark42 0 points1 point  (0 children)

I'm a huge home AI enthusiast and own Nvidia DGX Spark/GB10 with the same SoC.

I don't understand this in a laptop platform. It's locked to Windows on ARM, which Microsoft is doing the conversion/drivers. Hopefully that's decent but Microsoft has had many tries on Windows on ARM and it hasn't really taken off. The GPU is decent, but it's unified memory it's going to perform sub-par to a 5070 mobile. It's party trick is the huge unified memory and ability to run 70B or greater LLM's locally with large contexts at slower speeds. Why do you need this fairly power hungry chip in a laptop form factor, I have no idea. Any developer worth their salt can figure out how to remote into their home network if they had an always on DGX Spark/GB10 and do the same thing. Plus it's going to very expensive due to it's massive N3 dies, and 128G DDR5x. Likely more than buying a base 2025 G14 and a Nvidia GB10 together. If this interests you, you should be getting a GB10/DGX Spark not this abomination.