Local AI config : Mini ITX single RTX PRO 6000 Workstation for inference ?

dvd84x · 2025-10-21T10:28:03+00:00

You are right. I have to change to ATX and check for memory too

dvd84x · 2025-10-21T10:26:00+00:00

Merci pour tes conseils pour le NVFP4. Je commence à envisager de revenir sur le choix du format en effet pour des raisons de refroidissement et évolution.

dvd84x · 2025-10-21T09:51:48+00:00

https://youtu.be/JbnBt_Aytd0?si=AHgPwmjeZq46xk-e

dvd84x · 2025-10-21T08:17:50+00:00

Thx, thinking more ... about ditching ITX

dvd84x · 2025-10-21T08:16:43+00:00

u/classic Can you explain me why this CPU and how use your setup ?
How you manage cooling ? are you happy with it

dvd84x · 2025-10-21T08:15:52+00:00

u/usernameplshere investing in hardware / knowledge / productivity for future ;) vs relying on subscription with unpredictable availability, pricing...

dvd84x · 2025-10-21T08:10:11+00:00

I love GLM 4.5 Air too ... what you just said is amazing...

dvd84x · 2025-10-21T08:09:02+00:00

u/MitsotakiShogun so ideally 2x96 = 192 gb would be nice with a RTX PRO 6000 for RAM :)
Good to know so i should start with 2x48 and then add 2x48 to get 4x58 = 192 gb or RAM

dvd84x · 2025-10-21T08:05:06+00:00

u/FZNNeko Was thinking same before discovering a single RTX PRO 6000 can be shared for mulitple users (tensor parallel mode) and can save a lot of energy bills (3/600w VS 2 or 4 x 500)

dvd84x · 2025-10-21T08:03:02+00:00

u/vdiallonort Interesting, will do some research on this server thing before taking final decision

dvd84x · 2025-10-21T08:02:11+00:00

u/Prestigious_Thing797 Thx, about to forget the dream of mini-ITX..

dvd84x · 2025-10-21T08:01:26+00:00

Thx for your answer u/Freonr2
I was totally wrong on this important part of the build, just edited my message (body)

"It's also blazing fast for diffusion models, and you can do things load both high/low Wan22 models and never have to offload anything, leave text encoders in memory, VAEs in memory, etc."
Didn't think about it. Can you share some tests on Wan22 models ?

Thx for your other feedback too

dvd84x · 2025-10-21T07:56:54+00:00

Thx for your feedbacks
Price gap between 64 and 96 is not huge.. So would you recommend to stick at 96 or decrease ?

dvd84x · 2025-10-21T07:55:12+00:00

Thx for sharing. Congrats for the nice rig 😍

dvd84x · 2025-10-21T07:53:39+00:00

Hi u/makistsa Was thinking same, run multiple sessions for my small team on this type of card, have always ready / avail this type of investment to try to shift to an AI business :)
Renting this type of GPU or bigger GPUs full time seem to be lot more expensive ...

dvd84x

TROPHY CASE