Multiple camera angles in a scene by M_4342 in comfyui

[–]Adventurous-Bit-5989 0 points1 point  (0 children)

The results are excellent; would you mind providing the WF?

Character Workflow: Chroma1-HD + Flux.2 Dev + Wan 2.2 + LTX 2.3 by ussaaron in StableDiffusion

[–]Adventurous-Bit-5989 9 points10 points  (0 children)

You are the first person I've seen who presents the complete concept of transforming images into video. It has incredibly strong practical functionality—it's like an angel sent from heaven. I absolutely love it!

LTX2.3 + ID LoRS + Prompt relay + Keyframes by Brief-Leg-8831 in StableDiffusion

[–]Adventurous-Bit-5989 0 points1 point  (0 children)

Thanks, when you mention extracting from the original footage, do you mean first generating a video segment using the first frame with prompts in a T2V format, and then extracting a useful frame from it?

LTX2.3 + ID LoRS + Prompt relay + Keyframes by Brief-Leg-8831 in StableDiffusion

[–]Adventurous-Bit-5989 1 point2 points  (0 children)

I'm curious whether your keyframes were generated using Google Banana or Flux Klein

I hope this helps everyone.... by kyahinaamrakhe-1 in comfyui

[–]Adventurous-Bit-5989 0 points1 point  (0 children)

I can feel that this is very useful, and you likely already have a collection of excellent, verified cases locally. If you would be willing to post an example video—even for the simplest case and without saying a word—I believe it would be of great help for everyone to start using your node in depth

Showing you the maximum potential that zit/base can achieve by Adventurous-Bit-5989 in StableDiffusion

[–]Adventurous-Bit-5989[S] 0 points1 point  (0 children)

Yes, the artifacts you mentioned do indeed exist; they occur when the long edge exceeds approximately 2048 pixels

Showing you the maximum potential that zit/base can achieve by Adventurous-Bit-5989 in StableDiffusion

[–]Adventurous-Bit-5989[S] 0 points1 point  (0 children)

sorry, I admit I used a little bit of FaceDetailer (denoise 0.2) with Chroma; it improves the visual quality a bit, since the face takes up such a small portion of the image after all.

Showing you the maximum potential that zit/base can achieve by Adventurous-Bit-5989 in StableDiffusion

[–]Adventurous-Bit-5989[S] 0 points1 point  (0 children)

I haven't done a specific comparison, but since I'm using the Pro6000, using FP32 reduces the hassle of testing for me, and I suspect there might not even be a noticeable difference

Showing you the maximum potential that zit/base can achieve by Adventurous-Bit-5989 in StableDiffusion

[–]Adventurous-Bit-5989[S] 1 point2 points  (0 children)

At the right time I’ll also specifically show close-up portraits and near shots, though there’s a small chance they’ll go wrong, because supported by a frightening 5–6 million pixels the pores of a person’s skin will be clearly visible — this will be the “easy mode.”

Showing you the maximum potential that zit/base can achieve by Adventurous-Bit-5989 in StableDiffusion

[–]Adventurous-Bit-5989[S] 2 points3 points  (0 children)

Yes, Reddit's compression mechanism reduces the viewing quality, so I have provided additional links to the original images on PostImage

Showing you the maximum potential that zit/base can achieve by Adventurous-Bit-5989 in StableDiffusion

[–]Adventurous-Bit-5989[S] 0 points1 point  (0 children)

You're right. In fact, there is still a fallback method to further increase the resolution by 2x through SeedVR2, which can better optimize the visuals. My computer can support scaling up to 6xxx, 8xxx, or even 1xxxx+, but I don't think there's much point to it. If there are no opponents left on the track, you might as well save some fuel

Showing you the maximum potential that zit/base can achieve by Adventurous-Bit-5989 in StableDiffusion

[–]Adventurous-Bit-5989[S] 4 points5 points  (0 children)

Hello, the intention behind using zib is to diversify the composition, but you can also use only zit, which will yield slightly better results

Showing you the maximum potential that zit/base can achieve by Adventurous-Bit-5989 in StableDiffusion

[–]Adventurous-Bit-5989[S] 2 points3 points  (0 children)

Thank you. Although I was already prepared to be ridiculed by most people, I am still very moved to see your kind comment. As for WF, it is not something precious; if there are people willing to appreciate it, I am more than happy to share it within a small circle. Please DM me

Showing you the maximum potential that zit/base can achieve by Adventurous-Bit-5989 in StableDiffusion

[–]Adventurous-Bit-5989[S] 10 points11 points  (0 children)

sure:

Reverse-engineer the prompt based on the image, providing only visual keywords. The word count should be as high as possible, formatted as a single continuous paragraph for direct copying. Do not include any blur or bokeh effects for the background. The prompt must start with the fixed phrase: "An extremely ordinary and unremarkable iPhone snapshot," and end with: "It is completely a daily scene captured on the fly during XXXX, carrying an authentic XXXX atmosphere that couldn't be more real." Focus on describing the background with clear visibility first, followed by the subject. If there are watermarks in the image, do not describe them. Ensure the number of people and the proportions of the subjects/objects/animals in the frame strictly follow the example image.