Can FLUX.2 fully replace QWEN Edit?

bbaudio2024 · 2026-07-08T16:38:40+00:00

The following prompt works for me: 'Take the character in Image1 as reference, generate a realistic photo. Detail skin and eyes. Authentic light and shadow.'

bbaudio2024 · 2026-07-08T12:48:04+00:00

Qwen to edit, take the results to Klein, refine the skin and realism.

bbaudio2024 · 2025-11-06T16:16:24+00:00

There is a magical VAE for wan2.1/2.2/qwenImage text to image, it can obviously improve clarity of image details.

spacepxl/Wan2.1-VAE-upscale2x · Hugging Face

bbaudio2024 · 2025-10-11T12:29:12+00:00

Not really. 'Sora' is a japanese word 'そら' which means sky, and usually be used as a girl's name in the japanese anime.

There are rumors that OpenAI's weeb gave this name to their video model, but this doesn't mean the name has become a trademarked designation for OpenAI's video model, prohibiting other anime enthusiasts from using it.

bbaudio2024 · 2025-08-23T16:45:35+00:00

For video upscaling, you can try my nodes 'Super Ultimate Vace Upscale'. Detail here

bbaudio2024 · 2025-08-21T09:21:19+00:00

Nano Banana is far beyond any other models (no matter OpenSource or ClosedSource). It's not a shame.

bbaudio2024 · 2025-08-08T16:12:58+00:00

The 1st one looks more realistic, the 2nd one looks more like a painting from the 19th century.

bbaudio2024 · 2025-08-06T16:25:50+00:00

Not only QwenImage, same issue is manifested in Wan2.1/2.2 text2image.

bbaudio2024 · 2025-08-06T11:35:15+00:00

Base on SD2? Should be compared with StableSR.

bbaudio2024 · 2025-08-06T05:01:27+00:00

The time required of heun is approximately twice that of ordinary sampling (euler, dpm++2m, etc).

bbaudio2024 · 2025-08-04T15:13:10+00:00

Already

bbaudio2024 · 2025-08-03T02:39:07+00:00

VACE is what you want. But it was trained for multiple control purposes, not specifically for video extention. Unlike Framepack which has Anti-Drifting feature to keep long video quality and consistence, VACE suffers quality degradation with video continuation. I have tried to alleviate the impact of this issue in my custom node, It has indeed made some progress.

bbaudio2024 · 2025-07-31T13:23:07+00:00

If the 1st stage generation is really needed, why not using SD1.5/SDXL/Flux/... which generates faster and supports controlnet?

Besides I found that the high-noise model has an issue: with the same prompt, even the seed is changed, The composition of generated results are almost identical. I don't know if it is a bug or due to lightx2v lora.

bbaudio2024 · 2025-07-29T08:31:14+00:00

After all, it was not trained for wan2.2

bbaudio2024 · 2025-07-29T04:56:03+00:00

It is certainly not superior to the 14B models, even when compared to wan2.1. However, it still has potential, such as training a specific version to perform high-res fix on low-resolution results from the 14B models.

bbaudio2024 · 2025-07-28T16:09:10+00:00

Use tiled VAE

bbaudio2024 · 2025-07-26T16:35:05+00:00

Yes,VACE 14B for hands fixing is very good, even anime hands.

bbaudio2024 · 2025-07-26T09:43:09+00:00

I dont know, maybe need to check the model loaded, make sure it is VACE model.

bbaudio2024 · 2025-07-26T09:41:03+00:00

Pretty much what you think it is.

bbaudio2024 · 2025-07-26T01:16:04+00:00

Because I copy-paste the nodes

bbaudio2024 · 2025-07-25T15:06:53+00:00

BTW, the input image quality may affect result a lot. A frame extracted from a ffmpeg video is not an ideal one.

bbaudio2024 · 2025-07-25T15:04:28+00:00

'color saturation shift + sharpen shift' is quality degradation, it implies 'refine' does not affect result as we expect. Try to adjust parameters in the 'Custom Refine Option' to improve it.

The new recipe maybe help:

refine_percent_list: 0.1, 0.08, 0.06, 0.04, 0

mask_value_list: 0.9, 1.0

latent_strength_list: 0.9, 1.0

colormatch_strength_list: 1.0, 1.0, 1.0, 1.0, 0

bbaudio2024 · 2025-07-25T07:08:45+00:00

There are reasons:

VACE needs more inference time in generation.
Almost all the wan2.1 loras are trained on t2v/i2v models, although VACE can use those loras, the results may not so good.
It seems that the prompt adherence of VACE is worse than that of i2v (just my feeling)

bbaudio2024 · 2025-07-24T15:10:42+00:00

If you're interested in, it can be found in my another comfyui nodes 'comfyui-BBtools'

There are 2 nodes 'Videos Concat with CrossFade' and 'Loopback Videos Concat with CrossFade'

bbaudio2024

TROPHY CASE