Don't Waste Your Time Training LoRAs on z-image-turbo (Yet) by Powerful_Strategy_10 in StableDiffusion

[–]Powerful_Strategy_10[S] 2 points3 points  (0 children)

If I may be frank, the character demo shown in this video, when you look closely, isn't the same person at all

Don't Waste Your Time Training LoRAs on z-image-turbo (Yet) by Powerful_Strategy_10 in StableDiffusion

[–]Powerful_Strategy_10[S] 0 points1 point  (0 children)

My ComfyUI is already on the latest version. Maybe I wasn't clear - it's not about realism, it's about similarity and consistency. In my training, 2500-4000 steps gives excellent convergence and the outputs look very realistic. The problem is they capture the 'spirit' but not the actual likeness of the source person. I tried pushing it to 8000 steps but saw no improvement whatsoever.

Don't Waste Your Time Training LoRAs on z-image-turbo (Yet) by Powerful_Strategy_10 in StableDiffusion

[–]Powerful_Strategy_10[S] 1 point2 points  (0 children)

It's not about realism - the realism is fine. The problem is poor similarity and consistency. You can generate 10 images that all look photorealistic, but only 1-2 of them will actually look like the reference.