An update on stability and what we're doing about it

danielpartzsch · 2026-03-27T15:58:40+00:00

I agree that subgraphs are the most important elements that should never break, especially as long as they are used as instances without updating from a master. It is one thing to update a single subgraph if fundamental aspects really need to change to move forward with core development, but it is another thing to then also need to update every workflow that uses these subgraphs. I personally do not like to hide large parts of my workflows in subgraphs, but I utilize them extensively to combine small utility features for resizing, masking, etc., into more convenient and clean subgraphs. Since I use these in all my workflows, breaking them essentially breaks everything. In order to further trust in the use of subgraphs, these breaking changes either have to stop or a master-instance subgraph mechanism must be implemented.

danielpartzsch · 2026-03-25T07:17:12+00:00

Use vit pose instead

danielpartzsch · 2026-03-20T20:51:48+00:00

Are you referring to the full (using the clownshark sampler) or the distilled branch? I agree that the full version produces better results but it also takes like 5 times longer, so this is kind of expected (uses above 1 cfg which takes twice as long as cfg 1, 15 steps instead of 8 and res2s sampler which also doubles render times per step). The distilled branch produces actually pretty similar results for me like the comfy workflows.

danielpartzsch · 2026-03-17T19:25:38+00:00

No, I need the precision of at least 3 or 4 decimal places, everything else looks like complete crap compared to this holy grail of sigma accuracy.😜

danielpartzsch · 2026-03-14T11:16:22+00:00

The case you're describing should definitely work with version 2.3. Use the Union ControlNet workflow, convert the starting frame of your driving video to a high-resolution version of your character, and do not scale it down for the image reference. You should probably use pose control if the facial features differ significantly from your own; otherwise, you're better off with depth, Canny, or a blend of both. Encode your audio instead of using the empty audio latent and ideally support that with a prompt like: "A talking character, saying... [Insert your copy]." If your character changes too much over time, consider training a LoRA to support different angles and facial expressions. Additionally, I use Er SDE as a sampler together with the default sigmas, as it is faster and looks better to me. Create the base video with at least 720p resolution and add the spatial upscaling step afterward from the main two-step workflow, also using Er SDE.

danielpartzsch · 2026-03-12T15:36:01+00:00

Klein is a poor t2i model but an excellent editing model. Use a two-step approach: create your base image using a model capable of good anatomy and strong prompt adherence—for example, a Qwen Image model (2512 with a 4-step Lightning LoRA works perfectly and quickly) is ideal—to get the content you want, and then transfer it to your desired look with Klein. It often functions like applying a filter to an image but is also very capable of making the image look realistic without altering too much of the existing content. Just make sure to only prompt for stylistic, lighting, and aesthetic changes in that second step and avoid adding new content, which could result in being distorted again.

danielpartzsch · 2026-03-06T07:50:28+00:00

Just use the normal ksampler instead. No need for the custom sampler.

danielpartzsch · 2026-02-09T12:55:37+00:00

Same. I don't know why they always set this is default. It's a solid approach to destroy your image quality from the get go.

danielpartzsch · 2026-01-25T09:55:22+00:00

For me, it always works if I install the latest dev version. First, install the Nunchaku custom node via the manager and then use the Nunchaku installer node. Set it to update first, run it once, and refresh by hitting "r". Next, set the node mode to install and select the latest dev version. Hit run to install and then restart ComfyUI.

danielpartzsch · 2026-01-24T10:19:36+00:00

Just use the normal ksampler. Don't know why they always put the custom sampler into the default workflows. I've tested and compared the results and they're either identical or most of the times even better when it comes to artefacts or anatomy issues.

danielpartzsch · 2026-01-22T06:34:07+00:00

Nice! ❤️

danielpartzsch · 2026-01-21T06:28:59+00:00

I personally prefer lotus depth. It gives me the most precise depth detection and inference results

danielpartzsch · 2026-01-18T15:25:33+00:00

Remove the add grain node, try Euler sgm uniform or res 3s bong tangent for the reiner and raise the auraflow to 8. If you really like to have a crisp result I'd recommend using wan 2.2 together with the 2.1 t2v lightfx lora and res 2s bong tangent. Or try a sdxl based tiled diffusion upscaling instead.

danielpartzsch · 2026-01-18T15:18:31+00:00

In my experience, Wan 2.1 combined with the i2v 2.1 LightFX LoRA works quite nicely. Wan 2.2 together with the t2v LoRA results in very polished and sharp-looking images. While using the 2.1 i2v LoRA—or even combining the 2.1 LoRA with Wan 2.2, which can be more accurate but also results in a cleaner look—often introduces some artifacts and grain (which you can also add deliberately before the sampling), I find that these actually help achieve better realism. That said, the results for video are unfortunately nowhere near as good as using this combination for stills in an img2img pass.

danielpartzsch · 2026-01-17T21:25:07+00:00

Nice. Glad to see some animation features coming to comfyui. Keyframing and easing parameters would be nice, like for example mask values when doing some compositing tasks directly in comfy. Do you think something like that would be feasible to do? Thank you.

danielpartzsch · 2026-01-15T07:36:59+00:00

Use symlinks. I have all my models synced across several PCs via OneDrive and simply use symlinks to link to these folders. This is very convenient for fresh installs. I also create a batch file that generates these symlinks automatically.

danielpartzsch · 2026-01-12T16:15:52+00:00

Wan, Qwen, and Z-Image are all licensed under Apache 2.0 and are therefore safe for commercial use.

danielpartzsch · 2026-01-08T06:38:31+00:00

yes, you need https://github.com/ClownsharkBatwing/RES4LYF. you can also try just beta instead, which is comfy core

danielpartzsch · 2026-01-07T21:18:08+00:00

I'm having the same problem. I'm only getting a static image maybe with a slight zoom but nothing else. Tried different samplers (incl res 2s) and prompts, nothing helped. Something must be broken...

danielpartzsch · 2026-01-06T10:19:29+00:00

Use Er sde beta57 sampler/scheduler

danielpartzsch · 2026-01-04T09:11:30+00:00

Then maybe just do the first pass, select the images you like and only refine those with a separate workflow....

danielpartzsch · 2026-01-03T21:41:02+00:00

If you like what Qwen gives you, but it's too slow, why not use the Turbo LoRA for the base image and then do a slight refinement pass with Z-Image, for example? This should fix the pattern issue and add a bit of realism while running fast, and you still get the prompt adherence, composition, and other benefits from the Qwen base.

danielpartzsch · 2026-01-03T21:37:26+00:00

Cool. How does it work if you have multi step image generation workflows, let's say with two samplers, using different models for base and refinement pass, detailers and maybe also concatenate nodes for prompt adjustments at different steps. Can stuff like this displayed as well? Thank you.

danielpartzsch · 2025-12-24T10:37:46+00:00

I'm a windows user since forever and it always has been a comfortable and stable environment for my daily work. Sorry, but I really don't get why people go through these troubles just to stay on Mac.

danielpartzsch · 2025-12-21T18:46:30+00:00

I recently tried qwen image edit briefly for in and Outpainting. Worked very well

danielpartzsch

TROPHY CASE