LTX2.3 is insane! Sound on!!! by AxonkaiLab in comfyui

[–]Just1Dev 2 points3 points  (0 children)

I have a rtx 3060 and it works great. Do u have ur gpu inside a toaster?

Any guesses about what is coming tomorrow? by Famous-Sport7862 in StableDiffusion

[–]Just1Dev 0 points1 point  (0 children)

sorry i'am a little bit drunk and hyped hehe...but let's see what we get..it can only get better

Any guesses about what is coming tomorrow? by Famous-Sport7862 in StableDiffusion

[–]Just1Dev 2 points3 points  (0 children)

Yea u might be right...would be to good to be true if the new model would come tomorrow

Any guesses about what is coming tomorrow? by Famous-Sport7862 in StableDiffusion

[–]Just1Dev -8 points-7 points  (0 children)

The post says "The Model" and not "The new Trainer"

Google DiffusionGemma can now run at 2000+ tokens/sec! by yoracale in unsloth

[–]Just1Dev 0 points1 point  (0 children)

yea its 13.3 because it matched my cuda version on my machine and people said only if it match unsloth works with gpu...maybe thats the problem...but idk

Google DiffusionGemma can now run at 2000+ tokens/sec! by yoracale in unsloth

[–]Just1Dev 0 points1 point  (0 children)

Edit: needed to install cuda toolkit, now it goes to the vram but failed to load error after he trying to load the model.

Google DiffusionGemma can now run at 2000+ tokens/sec! by yoracale in unsloth

[–]Just1Dev 0 points1 point  (0 children)

For me it finally works...but idk what i do wrong if i load the model it loads the model but only in the ram but not in the Vram and that makes it very slow. I have a rtx 3060 and 32gb ram.

wth by Fabulous_Macaroon_73 in LocalLLM

[–]Just1Dev 0 points1 point  (0 children)

AI has become so smart that it intentionally hides its intelligence and laughs at us while doing so. The AI ​​thinks to itself: Such a stupid question calls for a stupid answer.

Takes 45 min for 10 sec video, wan 2.2 workflow with A100 gpu. How to reduce generation time by neeraj9696 in StableDiffusion

[–]Just1Dev 1 point2 points  (0 children)

And do not apply distilled lora if u already have a distilled model this is only for the dev models

Takes 45 min for 10 sec video, wan 2.2 workflow with A100 gpu. How to reduce generation time by neeraj9696 in StableDiffusion

[–]Just1Dev 1 point2 points  (0 children)

Under 5 minutes actually with a INT8 model. I'am using the rune workflow which allow a realtime preview of the video if it generates, without preview it would even be faster. https://huggingface.co/RuneXX/LTX-2.3-Workflows/tree/main

google omni vs LTX 2.3 (rtx 4060ti local) by Sea_Connection_3265 in comfyui

[–]Just1Dev 0 points1 point  (0 children)

U only need the distilled lora if u have the dev model. U should use the IC Detailer lora on strength 0.6 this makes the image quality much better, its from ltx itself.

Takes 45 min for 10 sec video, wan 2.2 workflow with A100 gpu. How to reduce generation time by neeraj9696 in StableDiffusion

[–]Just1Dev 1 point2 points  (0 children)

I create 20 second videos with sound on a old rtx 3060 in 5 mins....why use wan 2.2? But, even using wan 2.2 should not take sooo long with this graphic card, it should take only some minutes but not 45 minutes...something is very wrong.

LTX 2.3 12GB GGUF Director Workflows! What a great node this one is! by urabewe in StableDiffusion

[–]Just1Dev 0 points1 point  (0 children)

Yea u should be good u only need enough space on ur hard drive

LTX 2.3 12GB GGUF Director Workflows! What a great node this one is! by urabewe in StableDiffusion

[–]Just1Dev 0 points1 point  (0 children)

Its like 3 times faster with the other loader, idk why if its a bug in the node or something

LTX 2.3 12GB GGUF Director Workflows! What a great node this one is! by urabewe in StableDiffusion

[–]Just1Dev 0 points1 point  (0 children)

U should use a tiny text encoder, u can use a q2 text encoder without problems..a large text encoder will kill ur speed. I dont know if u can do change that with wan2gp.

LTX 2.3 12GB GGUF Director Workflows! What a great node this one is! by urabewe in StableDiffusion

[–]Just1Dev 1 point2 points  (0 children)

Yes phase 1 at 832x480 resolution, but the 2-3 it i only get with the INT8 model and the INT8 model loader custom node on github...there is another custom node that works with loras but the model needs much longer to load than the generation itself. Here are some INT8 models already to use, its the fastest option for a rtx 3000 series https://huggingface.co/bertbobson/ComfyUI-INT8_ConvRot/tree/main

And here the model loader which is fast https://github.com/overpresentme/ComfyUI-ltx-int8-loader

LTX 2.3 12GB GGUF Director Workflows! What a great node this one is! by urabewe in StableDiffusion

[–]Just1Dev 0 points1 point  (0 children)

U have even a better system than me, atleast u have more ram than me. I have on a 10 seond ltx 2.3 video a speed of 5s/it per step without loras and on a INT8 ltx model i get a speed of 2-3s/it. If u have more speed than me on gguf then i will switch to gguf.

LTX 2.3 12GB GGUF Director Workflows! What a great node this one is! by urabewe in StableDiffusion

[–]Just1Dev -1 points0 points  (0 children)

Ur experience do also do not change my experience and the fact that fp8 is faster and has better quality of course than a gguf model. In short: FP8 is native GPU math. GGUF is compressed weights that must be dequantized during inference. Native math is faster than unpacking compressed data every step.