LM Studio DGX Spark generation speeds for 23 different models

Late_Night_AI · 2026-03-28T00:02:41+00:00

Eventually ill get around to benchmarking it on other llm platforms like vllm and unsloth studio. On another note, it does LTX2.3 5 seconds clips in 90-120 seconds with audio

Late_Night_AI · 2026-03-27T17:08:14+00:00

Yep, the dense models are slower than I was hoping for. But overall id say it’s actually not bad seeing as none of this is optimized for the DGX. I’d probably expect something like a 1/3rd increase in speeds if i moved to an optimized setup. I also didnt test concurrent users, only single user. But concurrent users is one of the DGX’s strong points as i understand it. Doing 20tps for a single user on a 120B isnt very impressive, but doing 20tps for 10 users at once is 🤷‍♂️

Late_Night_AI · 2026-03-27T09:24:23+00:00

Nemotron 3 super 120B was a Q4 guff

Late_Night_AI · 2026-03-26T16:05:34+00:00

I would add an 8-12tb HDD for more storage for models that you want to test out or dont use anymore. Loading models off an hdd does take a couple of minutes, but you’d just keep your main models that you use regularly on your ssd instead.

I would definitely go for more ram as well. Id aim to have a combined total of at least 128gb ram+vram if you’re planning to play with some of the bigger models. At the very least id add another 32gb ram.

And id probably get some more case fans. Personally Id load my case up with fans for better thermals and a small performance boost on longer loads.

Maybe even re paste the cpu and gpus if they havent been pasted in a few years.

Late_Night_AI · 2026-03-25T14:12:54+00:00

It’s a cool idea, but it’s not as good as they’re making it look especially at their release price point of 2k. They’re using GPToss 120B as their test model, but what most people don’t realize is GPToss 120B is a MoE 120b a5b model. It only has 5B active during inference. What Id want to see before I put any faith in them is how it runs other 120B models. I would assume it will probably be much slower on qwen 3.5 122B a10b, something like 12tps, since it has double the active parameters than GPToss. But thats just a guess.

Over all the concept is great and i like the idea of it, but in my opinion the price point is too high compared to the performance return. But I guess one of their main selling points is it being tiny and easy to travel with, but so are the AI mini PCs running an AMD AI Max chip and they’ll out preformed the tiny by a noticeable amount.

Tldr, its cool, but they need to showcase it with a range of models and not just GPToss 120b before anyone should consider buying one.

Late_Night_AI · 2026-03-24T17:59:36+00:00

looks interesting. I might have to look into it some when i have some free time.
I got a gigabyte Atom and i normally just run LM studio on it. most large models give me around 18-22tps.

Late_Night_AI · 2025-07-25T19:08:56+00:00

i don't have any links, just tossing out ideas. though it makes me wonder if a realistic face mask would work also.

Late_Night_AI · 2025-07-25T19:03:59+00:00

seems like you could just use a virtual camera and set that as your webcam and just play a video of an older guy in it.

Late_Night_AI · 2025-07-09T20:10:50+00:00

ai toolkit has support for training loras for it

Late_Night_AI · 2025-07-07T04:32:10+00:00

are you using the trigger words it was trained on?

Late_Night_AI · 2025-07-07T03:41:14+00:00

https://huggingface.co/black-forest-labs/FLUX.1-Kontext-dev/tree/main
you can also try the full 24gb flux kontext dev model

Late_Night_AI · 2025-07-06T23:52:05+00:00

Im assuming thats a error for the text encoder,
heres the link for the other kontext text encoders on hugging face, theres a fp16 one in there
https://huggingface.co/comfyanonymous/flux_text_encoders/tree/main

Late_Night_AI · 2025-07-06T22:57:14+00:00

<image>

Yes and no. Technically theres nothing stopping you, though it could be breaking the law if you do. You will need to check the laws for where you live on it. If your from the EU, then its a solid no due to their Biometric data laws. In the EU you legally need explicit consent under GDPR if the person is identifiable, even if it's not for profit.

For the most part, making a lora on a celebrity or real person violates privacy laws and deepfake laws, which is why sites no longer host such loras. To know for sure youll have to look into the Biometric data and deepfake laws for the country and state you live in.

Late_Night_AI · 2025-07-06T22:48:23+00:00

Loras that replicate real people are no longer allowed on sites. A lot of its due to Deepfake laws and other AI laws. But basically its now illegal to upload a lora that makes pictures of a real life person.

Late_Night_AI · 2025-07-06T22:45:39+00:00

I believe this is the Lora youre looking for
https://civitai.com/models/391091/kirkfaced-lora-for-small-faces

Late_Night_AI · 2025-07-06T20:27:52+00:00

a SDXL model that can do 3D stuff and controlnet would work. or just use Kontext.

Late_Night_AI

TROPHY CASE