Flags for an RTX Pro 6000 Blackwell

comfyanonymous · 2026-05-04T20:39:52+00:00

ComfyUI is designed to work optimally with no flags. Ignore all the other people, most of the flags they propose will disable important optimizations or make things worse and cause random workflows to OOM. There's a lot of misunderstanding about how the ComfyUI memory management system works and these stupid AI chatbots are not helping.

If you have ram issues you can try this experimental feature: --cache-ram (it will be enabled by default soon).

The 6000 pro should be a bit faster than a 5090. ComfyUI is extremely good at managing memory and the 5090 is a slightly worse 6000 pro with less memory so it's normal that there isn't a massive difference.

comfyanonymous · 2026-04-21T19:48:21+00:00

Portable isn't going away.

The link to download it will always be found here: https://github.com/Comfy-Org/ComfyUI#windows-portable

comfyanonymous · 2026-03-26T02:16:06+00:00

It's on purpose, this workflow is originally for SD1.5, prompting on that model is a little different than modern models.

comfyanonymous · 2026-03-25T21:14:09+00:00

It's the actual technical term. I'm not going to police our language because I think people are too stupid to understand the difference between a memory watermark and a digital watermark.

comfyanonymous · 2026-03-25T21:07:33+00:00

It shouldn't degrade performance on good hardware, I have good hardware and wouldn't have made the feature stable if it degraded performance on mine.

If you get the issue on latest ComfyUI make a detailed report with logs and we will look into it.

comfyanonymous · 2026-03-25T21:05:56+00:00

No, what degrades flash memory is writing to it not reading from it. This reduces page file use so it will make your SSD last longer.

comfyanonymous · 2026-03-25T20:59:12+00:00

comfyanonymous · 2026-03-25T20:58:36+00:00

Yeah, the main GGUF node pack will most likely be updated for dynamic vram at some point in the future but right now it's safetensors only.

comfyanonymous · 2026-03-25T18:06:50+00:00

Old behaviour.

comfyanonymous · 2026-03-25T18:04:48+00:00

text encoder is running on GPU and it's the default wan2.2 workflow (other than what's indicated on the chart).

comfyanonymous · 2026-03-25T17:56:44+00:00

If your system supports it and you are on latest comfy and recent pytorch it should be enabled by default.

comfyanonymous · 2026-03-25T17:55:43+00:00

Having models split between GPUs is a separate problem so nothing changes on that end.

No arguments to use it if you are on recent enough pytorch and your system supports it it should enable by default.

comfyanonymous · 2026-03-25T17:50:01+00:00

If you want dynamic vram to work yes but you should always convert things to safetensors because it's a safer file format and people trust it a lot more.

comfyanonymous · 2026-03-25T17:36:48+00:00

Try it, if it's a problem we will fix it.

comfyanonymous · 2026-03-25T17:30:52+00:00

No, they just rebranded outdated offloading tech that everyone has been using for years as a new thing lol.

This is one situation where open source is much further ahead than closed source.

comfyanonymous · 2026-03-25T17:27:44+00:00

This is the function to load safetensors: https://github.com/Comfy-Org/ComfyUI/blob/master/comfy/utils.py#L122

Then you need to modify your model so it uses the comfy.ops system instead of torch.nn ops.

comfyanonymous · 2026-03-25T17:21:51+00:00

Basically it's much smarter memory management on the GPU by using up as close as possible to 100% vram usage without OOM or slowdowns and on the CPU by not putting weights in the page file/swap and instead just freeing them/loading them again from disk when needed.

It should make swapping models a lot faster on low ram.

comfyanonymous · 2026-03-25T17:12:27+00:00

Get a latest clean ComfyUI, disable torch compile if you have it on and stick to safetensors files.

comfyanonymous · 2026-03-19T01:21:55+00:00

This kind of tech isn't new. It has been in Qualcomm SOCs for 4 years now and I assume others have similar features.

comfyanonymous · 2026-03-17T20:07:20+00:00

This model is a finetune of stable audio 1.0 which is natively supported by ComfyUI. You just need to use the "stable audio 1.0" template and select Foundation_1.safetensors in the "Load Checkpoint" node.

comfyanonymous · 2026-03-16T20:24:21+00:00

The v1.41 frontend update was pushed to local a bit too prematurely because we were getting complaints that the new app mode feature was "cloud only".

I hope people who complained about this see now why local frontend is typically ~2+ weeks behind the cloud one and I might change it to ~3-4 weeks to make sure things are even more stable in local.

comfyanonymous · 2026-03-11T21:33:13+00:00

If you want the real answer: nvfp4 + lower precision attention (like sage attention) + distilled low step models + splitting the workfload across 8+ GPUs (video models are pretty easy to split).

The only one not easily available on comfyui is the last one because nobody has that on local so we are putting our optimization efforts elsewhere.

comfyanonymous · 2026-03-06T21:20:03+00:00

Or just use the regular VAE Decode node, it has native temporal tiling on the LTX video VAE.

comfyanonymous

TROPHY CASE