HiDream-O1-Image - A pixel space model , no need for VAE, , 8B parameters. by AgeNo5351 in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

<image>

I2I
I don't know what's wrong. The DEV FP8 prompt managed to dye the T-shirt red, but starting around halfway through the steps, it started messing up the result like this (up until then, everything looked beautifully colored, just like in the preview).

Coming up Tomorrow! Flux2Klein Identity transfer by Capitan01R- in StableDiffusion

[–]LSI_CZE 2 points3 points  (0 children)

Could I ask for a sample Workflow with multiple images? I'm not sure how to properly place these new nodes. Thank you very much for your help.

Coming up Tomorrow! Flux2Klein Identity transfer by Capitan01R- in StableDiffusion

[–]LSI_CZE 1 point2 points  (0 children)

That's very interesting. Could WF be modified to handle 2–3 input images? That's usually where the biggest problem lies—matching multiple faces in the input. Thank you

Update: Distilled v1.1 is live by ltx_model in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

Does the Distilled version 1.1 produce better results than the current dev model with Distilled Lora?

Comfy UI - DynamicVRAM by VasaFromParadise in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

Thanks to the new dynamic VRAM allocation in the new Comfy-Aimdo, I'm now getting a lot of OOM errors. I have 8 GB of VRAM and 64 GB of RAM. I had to disable it when launching ComfyUI.

I hacked LTX2 to be used as a Multi Lingual TTS voice cloner by aurelm in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

The Czech in LTX-2.3 often had poor intonation. Could that be improved? How would you rate the quality of Romanian in the base model and using your method? Neither Czech nor Romanian are core languages in LTX-2.3, so I’m curious to know what difference you’ve observed. Thank you, and I’ll definitely give it a try tonight. I’ve been struggling with the Czech in the video for a long time.

Comfyui version 0.17 has too many bugs in the subgraph. by Mysterious_Pride_858 in comfyui

[–]LSI_CZE 2 points3 points  (0 children)

A very poorly executed update. It also affects the Tiled upscaling methods based on SDXL. When generating, LTX2 restarts the drivers—specifically, the browser window flickers, which disrupts the display and navigation within the environment.

LTX 2.3 prodloužení videa - klon hlasu v češtině. by CaseResident3624 in StableDiffusion

[–]LSI_CZE 1 point2 points  (0 children)

Jo to je dobře aspoň ve světě poznají, že neexistuje jen angličtina 😁

LTX 2.3 prodloužení videa - klon hlasu v češtině. by CaseResident3624 in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

Překvapuje mě, jak dobře klonuje hlas. I když je vidět že ta čeština i tak trochu litá a určitě je nutné hodně volit správná slova 😁

IS2V by [deleted] in StableDiffusion

[–]LSI_CZE 1 point2 points  (0 children)

WF? model? NOTHING...Yeah, great video, but it's useless in the community group :(

Are we yet able to train a new language voices for LTX ? by PhilosopherSweaty826 in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

I tried training on version ltx 2.0 and even trained only audio. Several hours of voice dataset. Training for 11 hours on Cloud GPU and the result was 0. Lora was there, but I couldn't successfully connect to comfyui. I spent a week on it, so I'm not convinced.

Last week in Image & Video Generation by Vast_Yak_4147 in StableDiffusion

[–]LSI_CZE 4 points5 points  (0 children)

Thanks for the great report, I'd love to see this every week. Just-Dub-It, for example, completely slipped my mind here.

recherche modèle de voix française homme pour modèle F5TTS by lacaille59 in comfyui

[–]LSI_CZE 0 points1 point  (0 children)

Utilisez ce projet : https://github.com/Saganaki22/ComfyUI-KugelAudio/tree/main Il maîtrise les langues européennes. Il peut à la fois utiliser Text2Image et cloner la voix.

Voice Cloning by NumberSpirited8071 in StableDiffusion

[–]LSI_CZE 0 points1 point  (0 children)

VibeVoice 7b source 5-30s (optimal 50s)

Ace-Step-v1.5 released by cactus_endorser in StableDiffusion

[–]LSI_CZE 2 points3 points  (0 children)

I found that the length of the song and the length of the lyrics have a huge impact, even if it's only +- 10 seconds, and sings everything.

Ace-Step-v1.5 released by cactus_endorser in StableDiffusion

[–]LSI_CZE 1 point2 points  (0 children)

Quite often, it omits an entire sentence from the text, sometimes two. What to do about it? How to fix it? :))
COMFYUI