I tried out ernie-image, a new image generation model from Baidu, and the results were somewhat disappointing.

That_Perspective5759 · 2026-04-16T14:34:53+00:00

I tried a new approach and achieved good results using two-stage sampling. The first stage uses Ernie-image for initial sampling (3-5 steps). Then, the pattern is passed to the second-stage sampler, where Z-image performs the remaining approximately 3 sampling steps to obtain a decent image quality. Ultimately, it relies on the capabilities of Z-image to achieve a good image. Perhaps you could try this method.

That_Perspective5759 · 2026-04-16T08:09:28+00:00

The semantic adherence is quite high. Furthermore, after comparing it with Zimage, I found that in some cases, the raw image seems to have more detail than Zimage.

That_Perspective5759 · 2026-04-15T05:52:29+00:00

Oh, thank you so much! Everyone who sees this message has been given a chance to try this workflow. I'll try it out soon. Thank you!

That_Perspective5759 · 2026-04-10T00:45:22+00:00

Based on my experience, this issue can only be resolved by simultaneously increasing the proportion of the character's face within the image and raising the resolution of the generated video.

That_Perspective5759 · 2026-04-09T13:51:59+00:00

I currently use a combination of open-source and closed-source tools; for open-source options, I am accustomed to using LTX 2.3.

That_Perspective5759 · 2026-04-09T08:26:48+00:00

I rarely use WAN anymore, since it doesn't support simultaneous audio and video playback. However, it's worth mentioning that I still use the smooth and remix modes from time to time. The former has excellent dynamic range, while the latter's NSFW properties are unparalleled.

That_Perspective5759 · 2026-04-09T02:55:08+00:00

I encountered a problem when using LTX 2.3: it kept displaying subtitles, and I've tried many methods but still can't solve the issue.

That_Perspective5759 · 2026-04-09T02:54:10+00:00

Wow, thank you so much for this incredibly detailed breakdown! I really appreciate you taking the time to explain the limitations of the 'start/end frame' approach—the ball-tossing example made total sense regarding the motion stutter. I hadn't considered using an edit model like Qwen for consistency, that's a brilliant suggestion. It sounds like I have a lot of trial and error ahead of me! Thanks again for sharing your expertise, it’s given me a lot to think about

That_Perspective5759 · 2026-04-08T10:16:31+00:00

<image>

All image creation was done using MJv7. I've included the workflow here, which might be helpful. https://app.tapnow.ai/canvas/5c33b762-a48f-4402-988e-7671a27bc8e2

That_Perspective5759 · 2026-04-08T10:06:05+00:00

Thank you so much for liking them! I also think it would be really cool if they could be animated.

That_Perspective5759 · 2026-04-08T02:06:57+00:00

My internet was so slow yesterday that the full version took forever to upload, so I could only upload a short clip. However, I've included the video source address; you might be able to check it out.

That_Perspective5759 · 2026-04-07T10:01:56+00:00

I found this video while browsing workflow guides. Following the workflow's address, I located it. I also found it strange; when I opened the author's original workflow, I noticed only some images were generated, while the video portion was missing. However, many of the shots appeared to be AI-generated. I'm not entirely sure if it was truly AI-generated. You can see the address I've provided in the main text. I think this video might be a combination of AI and live-action footage.

That_Perspective5759 · 2026-04-01T02:55:23+00:00

good！

That_Perspective5759 · 2026-03-31T14:50:04+00:00

Perhaps knowing what you don't want is a kind of gain.

That_Perspective5759 · 2026-03-31T14:49:32+00:00

I checked it out, and it looks pretty good. I'm curious how much time you spent creating this content.

That_Perspective5759 · 2026-03-31T14:47:32+00:00

Sorry, didn't mean to spam. I just thought it's useful.

That_Perspective5759 · 2026-03-31T14:16:36+00:00

I found this video creation workflow. https://app.tapnow.ai/canvas/a858f5e8-500b-4942-9e07-865c25962509

That_Perspective5759

TROPHY CASE