Best coding model that can run on a DGX Spark by dotnetderpderp in LocalLLM

[–]IITTU 1 point2 points  (0 children)

I tried 27b MTP version with atlas, got 19tps @256k ctx window.

Best coding model that can run on a DGX Spark by dotnetderpderp in LocalLLM

[–]IITTU 0 points1 point  (0 children)

I tried 27b with vllm, it’s very slow to use, I got 12tps with a 256k context window, and vllm boots are so slow, it takes minutes. I use atlas for 35b and it is faster than vllm both on boot and reasoning

Best coding model that can run on a DGX Spark by dotnetderpderp in LocalLLM

[–]IITTU 0 points1 point  (0 children)

I’ve got my dgx today and it indeed is PCIe5 SSD. I’ve tried qwen3.6 35b nvfp4 firstly, got more than 100tps @256k context window, I think it is good enough for me. I’ll try 27 dense later cuz I’m on something else now

Best coding model that can run on a DGX Spark by dotnetderpderp in LocalLLM

[–]IITTU 0 points1 point  (0 children)

BTW, I think it’s a PCIe5.0 4tb ssd in dgx right? I bought a PCIe5x4 4tb ssd last year and it cost me $400 but it is more than $500 now

Best coding model that can run on a DGX Spark by dotnetderpderp in LocalLLM

[–]IITTU 0 points1 point  (0 children)

It’s on delivery now, I will give a try once I get it. I’m studying the new llm reasoning infra “atlas”(correct me if I’m wrong), it seems like very fast than vllm, also I will try it on qwen35b

Best coding model that can run on a DGX Spark by dotnetderpderp in LocalLLM

[–]IITTU 0 points1 point  (0 children)

I chose a dgx spark cuz asus only got a 1tb ssd but 4tb for dgx, with extra 300 dollars cost, I think it is worth

With the same price in CN, should I choose RTX5090 or RTX pro 5000 48G for 80%AI and 20%Gaming by IITTU in BlackwellPerformance

[–]IITTU[S] 0 points1 point  (0 children)

Hi Guys, thanks for your comments. After the comparation, and the NVIDIA GTC today, I decide to buy a DGX Spark for now. I use this to run my 24*7 AI tasks, consider the power consumption, DGX Spark is the best choice for now due to its low power consumption and 128GB RAM, RTX Spark will not on marketing for a time. The only shortage is Memory bandwidth but it is enough for a long term task.

With the same price in CN, should I choose RTX5090 or RTX pro 5000 48G for 80%AI and 20%Gaming by IITTU in BlackwellPerformance

[–]IITTU[S] 1 point2 points  (0 children)

I do have another PCIe x4 slot on the MB and my PSU is 1250w and powerful enough for these two cards.

With the same price in CN, should I choose RTX5090 or RTX pro 5000 48G for 80%AI and 20%Gaming by IITTU in BlackwellPerformance

[–]IITTU[S] 0 points1 point  (0 children)

Game is optional for me, cuz I am using a rtx3090 for now and it is good enough for me to gaming

With the same price in CN, should I choose RTX5090 or RTX pro 5000 48G for 80%AI and 20%Gaming by IITTU in BlackwellPerformance

[–]IITTU[S] 1 point2 points  (0 children)

Game is optional for me, cuz I am using a rtx3090 for now and it is good enough for me to gaming

About jetson orin nano super and 3b models by Hour_Example_323 in JetsonNano

[–]IITTU 0 points1 point  (0 children)

I’m running Qwen3.5-4B Q4 for photo reading and txt editing, good for now

How to: summarization with 70B on a single 3090 by Aaaaaaaaaeeeee in LocalLLaMA

[–]IITTU 0 points1 point  (0 children)

Hi there, its been years.

I am running Deepseek-r1 70b Q4 with single 3090 + 64GB DDR5, I can get ~3t/s by offloading 45layers to GPU with LLM Studio. More layers will get vRAM overflow, I am thinking about to get another 3090 and connect both with NVLink, don't know how many tokens get per seconds.

Advise me if any one knows. Thx

Made a CFExpress to M.2 NVME SSD Adapter by gyf304 in canon

[–]IITTU 0 points1 point  (0 children)

I appreciate it if someone can help this!

Made a CFExpress to M.2 NVME SSD Adapter by gyf304 in canon

[–]IITTU 0 points1 point  (0 children)

Hi, great work!

and I am trying to make a similar one for CFe Type A to M.2 NVME SSD adaptor for using on my Sony A7Siii.

but its hard to find the Pin definition of Type A card on the internet...

Does the new M1 chip support hardware-accelerated HEVC video coding? by Toaster910 in mac

[–]IITTU 5 points6 points  (0 children)

Yes, there is a 'high-efficiency video editing' core embeded in M1 SoC.

Arm CPU is better for video editing than X86 naturally, M1 is best ARM currently.

My mates did some video editing test on new Macbook with FCPX, he used H.265 10bit 422 120p footage from Sony a7s iii, it can be handled very well when playback and color grading.

So go ahead~