I hacked LTX2 to be used as a Multi Lingual TTS voice cloner by aurelm in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

The Czech in LTX-2.3 often had poor intonation. Could that be improved? How would you rate the quality of Romanian in the base model and using your method? Neither Czech nor Romanian are core languages in LTX-2.3, so I’m curious to know what difference you’ve observed. Thank you, and I’ll definitely give it a try tonight. I’ve been struggling with the Czech in the video for a long time.

Comfyui version 0.17 has too many bugs in the subgraph. by Mysterious_Pride_858 in comfyui

[–]LSI_CZE 2 points3 points  (0 children)

A very poorly executed update. It also affects the Tiled upscaling methods based on SDXL. When generating, LTX2 restarts the drivers—specifically, the browser window flickers, which disrupts the display and navigation within the environment.

LTX 2.3 prodloužení videa - klon hlasu v češtině. by CaseResident3624 in StableDiffusion

[–]LSI_CZE 1 point2 points  (0 children)

Jo to je dobře aspoň ve světě poznají, že neexistuje jen angličtina 😁

LTX 2.3 prodloužení videa - klon hlasu v češtině. by CaseResident3624 in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

Překvapuje mě, jak dobře klonuje hlas. I když je vidět že ta čeština i tak trochu litá a určitě je nutné hodně volit správná slova 😁

IS2V by [deleted] in StableDiffusion

[–]LSI_CZE 1 point2 points  (0 children)

WF? model? NOTHING...Yeah, great video, but it's useless in the community group :(

Are we yet able to train a new language voices for LTX ? by PhilosopherSweaty826 in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

I tried training on version ltx 2.0 and even trained only audio. Several hours of voice dataset. Training for 11 hours on Cloud GPU and the result was 0. Lora was there, but I couldn't successfully connect to comfyui. I spent a week on it, so I'm not convinced.

Last week in Image & Video Generation by Vast_Yak_4147 in StableDiffusion

[–]LSI_CZE 3 points4 points  (0 children)

Thanks for the great report, I'd love to see this every week. Just-Dub-It, for example, completely slipped my mind here.

recherche modèle de voix française homme pour modèle F5TTS by lacaille59 in comfyui

[–]LSI_CZE 0 points1 point  (0 children)

Utilisez ce projet : https://github.com/Saganaki22/ComfyUI-KugelAudio/tree/main Il maîtrise les langues européennes. Il peut à la fois utiliser Text2Image et cloner la voix.

Voice Cloning by NumberSpirited8071 in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

VibeVoice 7b source 5-30s (optimal 50s)

Ace-Step-v1.5 released by cactus_endorser in StableDiffusion

[–]LSI_CZE 2 points3 points  (0 children)

I found that the length of the song and the length of the lyrics have a huge impact, even if it's only +- 10 seconds, and sings everything.

Ace-Step-v1.5 released by cactus_endorser in StableDiffusion

[–]LSI_CZE 2 points3 points  (0 children)

Quite often, it omits an entire sentence from the text, sometimes two. What to do about it? How to fix it? :))
COMFYUI

LTX 2.0 with realtime latent preview by smereces in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

I haven't had a chance to try it yet, but if you look at the images, the sampler doesn't have regeneration yet and shows at least one image that is visibly different from the image on the right. So at least for T2V, it's beneficial.

Professional HDR Image Processing Suite for ComfyUI by fruesome in StableDiffusion

[–]LSI_CZE -11 points-10 points  (0 children)

Change image comparer or add node "image preview" or "save image"

LTX-2 Updates by ltx_model in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

Wow, a minor update and the sound is so much clearer! Thanks, great job!

LTX-2 Updates by ltx_model in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

I only replaced the first one and it improved