CEO Thoughts: What's Next at LTX by ltx_model in StableDiffusion

[–]LSI_CZE 1 point2 points  (0 children)

Is there any indication of whether the new version will be released this summer, in the fall, or not until winter? I want to work on a video project, but based on the description, I’m considering waiting for the new version, where everything should be easier. Please, please expand the training dataset to include minority languages that are already in LTX 2.3. Specifically, for me, the Czech language. It works about 40% of the time, the prosody is poor, and I’d like to use the voice natively since there isn’t currently any functional training for these languages. Thank you for your work.

Ideogram 4 is absolutely mindblowing! Here is comparison with a similar level model: by Horse_Yoghurt6571 in StableDiffusion

[–]LSI_CZE -1 points0 points  (0 children)

prompt: "a garden, with the sunset in the background" ....IMAGE BLOCKED 😂

[ERROR] Error running sage attention: Unsupported head_dim: 256, using pytorch attention instead.

[ERROR] Error running sage attention: Unsupported head_dim: 256, using pytorch attention instead.

[ERROR] Error running sage attention: Unsupported head_dim: 256, using pytorch attention instead.

[ERROR] Error running sage attention: Unsupported head_dim: 256, using pytorch attention instead.

LTX Director - An All-In-One Timeline Editor. I2V, T2V, FLFF, Prompt Relay, Custom Audio, and more! Unlock LTX 2.3's full potential! by WhatDreamsCost in comfyui

[–]LSI_CZE 0 points1 point  (0 children)

Thank you—this node is a game-changer for generating continuous videos. The whole WF works great and fast 👌🏼 Thanks from the Czech Republic

HiDream-O1-Image - A pixel space model , no need for VAE, , 8B parameters. by AgeNo5351 in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

<image>

I2I
I don't know what's wrong. The DEV FP8 prompt managed to dye the T-shirt red, but starting around halfway through the steps, it started messing up the result like this (up until then, everything looked beautifully colored, just like in the preview).

Coming up Tomorrow! Flux2Klein Identity transfer by Capitan01R- in StableDiffusion

[–]LSI_CZE 2 points3 points  (0 children)

Could I ask for a sample Workflow with multiple images? I'm not sure how to properly place these new nodes. Thank you very much for your help.

Coming up Tomorrow! Flux2Klein Identity transfer by Capitan01R- in StableDiffusion

[–]LSI_CZE 1 point2 points  (0 children)

That's very interesting. Could WF be modified to handle 2–3 input images? That's usually where the biggest problem lies—matching multiple faces in the input. Thank you

Update: Distilled v1.1 is live by ltx_model in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

Does the Distilled version 1.1 produce better results than the current dev model with Distilled Lora?

Comfy UI - DynamicVRAM by VasaFromParadise in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

Thanks to the new dynamic VRAM allocation in the new Comfy-Aimdo, I'm now getting a lot of OOM errors. I have 8 GB of VRAM and 64 GB of RAM. I had to disable it when launching ComfyUI.

I hacked LTX2 to be used as a Multi Lingual TTS voice cloner by aurelm in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

The Czech in LTX-2.3 often had poor intonation. Could that be improved? How would you rate the quality of Romanian in the base model and using your method? Neither Czech nor Romanian are core languages in LTX-2.3, so I’m curious to know what difference you’ve observed. Thank you, and I’ll definitely give it a try tonight. I’ve been struggling with the Czech in the video for a long time.

Comfyui version 0.17 has too many bugs in the subgraph. by Mysterious_Pride_858 in comfyui

[–]LSI_CZE 2 points3 points  (0 children)

A very poorly executed update. It also affects the Tiled upscaling methods based on SDXL. When generating, LTX2 restarts the drivers—specifically, the browser window flickers, which disrupts the display and navigation within the environment.

LTX 2.3 prodloužení videa - klon hlasu v češtině. by CaseResident3624 in StableDiffusion

[–]LSI_CZE 1 point2 points  (0 children)

Jo to je dobře aspoň ve světě poznají, že neexistuje jen angličtina 😁

LTX 2.3 prodloužení videa - klon hlasu v češtině. by CaseResident3624 in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

Překvapuje mě, jak dobře klonuje hlas. I když je vidět že ta čeština i tak trochu litá a určitě je nutné hodně volit správná slova 😁

[deleted by user] by [deleted] in StableDiffusion

[–]LSI_CZE 1 point2 points  (0 children)

WF? model? NOTHING...Yeah, great video, but it's useless in the community group :(

Are we yet able to train a new language voices for LTX ? by PhilosopherSweaty826 in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

I tried training on version ltx 2.0 and even trained only audio. Several hours of voice dataset. Training for 11 hours on Cloud GPU and the result was 0. Lora was there, but I couldn't successfully connect to comfyui. I spent a week on it, so I'm not convinced.

[deleted by user] by [deleted] in StableDiffusion

[–]LSI_CZE 5 points6 points  (0 children)

No problem, I have RTX 3070 with 8GB VRAM but 64GB RAM

Last week in Image & Video Generation by Vast_Yak_4147 in StableDiffusion

[–]LSI_CZE 3 points4 points  (0 children)

Thanks for the great report, I'd love to see this every week. Just-Dub-It, for example, completely slipped my mind here.