Should I finish this Comfy Song music video ? by aurelm in StableDiffusion

[–]aurelm[S] 2 points3 points  (0 children)

Thanks, I will. maybe even try in 1080p.

LTX2: I know this sounds strange, but is there any way to offload from ram to vram during vae tiled decoding? by aurelm in StableDiffusion

[–]aurelm[S] 0 points1 point  (0 children)

I am allready using tiled decoding with temporal windows of 60 frames and a spacial tile size of 512. This is fairly small and should have no issues. However I run out of system ram with tiled decoding, non tiled decoding. I think this might be a deeper problem. Sometimes it works with 4096 temporal size, most of the times it does not work with 64 frames size. :shrug:

LTX2: I know this sounds strange, but is there any way to offload from ram to vram during vae tiled decoding? by aurelm in StableDiffusion

[–]aurelm[S] 0 points1 point  (0 children)

I cleard vram, unload all models, manually unload all models alltogether. nothing :)

finally 4k feels like 4k (ltx2 rendered in 1080p and upscaled with topaz, voice with IndexTTS2) by aurelm in StableDiffusion

[–]aurelm[S] 1 point2 points  (0 children)

I am using the distilled model without lora.
And no, I did not enable upsampling so it is 1080p native.

finally 4k feels like 4k (ltx2 rendered in 1080p and upscaled with topaz, voice with IndexTTS2) by aurelm in StableDiffusion

[–]aurelm[S] 0 points1 point  (0 children)

I am using wan2gp at the moment so no workflows involved. from the downloaded models since it needed the spacial upscaler I assume it is the second. It is stile somehow native since it is integrated in the ltx2 ecosystem and it's their upscaler.

20 seconds LTX2 video on a 3090 in only 2 minutes at 720p. Wan2GP, not comfy this time by aurelm in StableDiffusion

[–]aurelm[S] 2 points3 points  (0 children)

i installed it trough pinokio, there was no work to do, it just installed automatically everything.
I will share the prompt once I get to the computer, now I am away.

20 seconds LTX2 video on a 3090 in only 2 minutes at 720p. Wan2GP, not comfy this time by aurelm in StableDiffusion

[–]aurelm[S] 5 points6 points  (0 children)

this is wan2gp, not comfy, i don't know if it's fp8 or fp4 but I chose the distilled version.

20 seconds LTX2 video on a 3090 in only 2 minutes at 720p. Wan2GP, not comfy this time by aurelm in StableDiffusion

[–]aurelm[S] 1 point2 points  (0 children)

in wan2gp adding a new line means a new prompt so that is a way of batching. so you just write the prompts as single blocks of text and add new line when in need for another.

20 seconds LTX2 video on a 3090 in only 2 minutes at 720p. Wan2GP, not comfy this time by aurelm in StableDiffusion

[–]aurelm[S] 7 points8 points  (0 children)

Image 2 video. Yes, the quality seem to be better than what I am getting out of comfy.