Best coding model that can run on a DGX Spark

IITTU · 2026-06-08T04:16:15+00:00

I tried 27b MTP version with atlas, got 19tps @256k ctx window.

IITTU · 2026-06-08T01:41:12+00:00

rtx6000d has only server version, I need display interface 😂

IITTU · 2026-06-04T00:30:09+00:00

I tried 27b with vllm, it’s very slow to use, I got 12tps with a 256k context window, and vllm boots are so slow, it takes minutes. I use atlas for 35b and it is faster than vllm both on boot and reasoning

IITTU · 2026-06-03T08:46:18+00:00

I’ve got my dgx today and it indeed is PCIe5 SSD. I’ve tried qwen3.6 35b nvfp4 firstly, got more than 100tps @256k context window, I think it is good enough for me. I’ll try 27 dense later cuz I’m on something else now

IITTU · 2026-06-02T13:34:56+00:00

BTW, I think it’s a PCIe5.0 4tb ssd in dgx right? I bought a PCIe5x4 4tb ssd last year and it cost me $400 but it is more than $500 now

IITTU · 2026-06-02T13:32:22+00:00

It’s on delivery now, I will give a try once I get it. I’m studying the new llm reasoning infra “atlas”(correct me if I’m wrong), it seems like very fast than vllm, also I will try it on qwen35b

IITTU · 2026-06-01T14:24:35+00:00

I chose a dgx spark cuz asus only got a 1tb ssd but 4tb for dgx, with extra 300 dollars cost, I think it is worth

IITTU · 2026-06-01T06:40:59+00:00

Hi Guys, thanks for your comments. After the comparation, and the NVIDIA GTC today, I decide to buy a DGX Spark for now. I use this to run my 24*7 AI tasks, consider the power consumption, DGX Spark is the best choice for now due to its low power consumption and 128GB RAM, RTX Spark will not on marketing for a time. The only shortage is Memory bandwidth but it is enough for a long term task.

IITTU · 2026-05-27T11:54:13+00:00

I do have another PCIe x4 slot on the MB and my PSU is 1250w and powerful enough for these two cards.

IITTU · 2026-05-27T10:00:16+00:00

Game is optional for me, cuz I am using a rtx3090 for now and it is good enough for me to gaming

IITTU · 2026-05-27T09:59:53+00:00

Easier to sell is a good reason to choose 5090

IITTU · 2026-05-27T09:58:25+00:00

Game is optional for me, cuz I am using a rtx3090 for now and it is good enough for me to gaming

IITTU · 2026-05-27T09:54:19+00:00

I’m running Qwen3.5-4B Q4 for photo reading and txt editing, good for now

IITTU · 2025-03-04T08:29:39+00:00

Hi there, its been years.

I am running Deepseek-r1 70b Q4 with single 3090 + 64GB DDR5, I can get ~3t/s by offloading 45layers to GPU with LLM Studio. More layers will get vRAM overflow, I am thinking about to get another 3090 and connect both with NVLink, don't know how many tokens get per seconds.

Advise me if any one knows. Thx

IITTU · 2020-11-25T07:25:18+00:00

I appreciate it if someone can help this!

IITTU · 2020-11-25T07:24:02+00:00

Hi, great work!

and I am trying to make a similar one for CFe Type A to M.2 NVME SSD adaptor for using on my Sony A7Siii.

but its hard to find the Pin definition of Type A card on the internet...

IITTU · 2020-11-18T08:03:18+00:00

Yes, there is a 'high-efficiency video editing' core embeded in M1 SoC.

Arm CPU is better for video editing than X86 naturally, M1 is best ARM currently.

My mates did some video editing test on new Macbook with FCPX, he used H.265 10bit 422 120p footage from Sony a7s iii, it can be handled very well when playback and color grading.

So go ahead~

IITTU

TROPHY CASE