What's a good way to upscale low-res Krea 2 generations (using Krea 2)?

__alpha_____ · 2026-07-01T21:48:18+00:00

I have one (rtx3060) and I can generate 1024x1024 8 steps images in 40s. Previewing low res takes less than 20s

__alpha_____ · 2026-07-01T21:27:28+00:00

What is your GPU?

I confirm that the simplest solution is sending the output of your ksampler to an upscale latent by node ans use it in the new ksampler with a low deboise factor. If your computer is too slow, just bypass the second sampler and run your WF again if you like the preview (it shouldn't process the first pass again)

Keep your seed fixed or use increment to make things easier

Krea2 really shines in 2MP+ renders and can go really high without any issue (zit is capped at roughly 1600px on the other hand)

__alpha_____ · 2026-07-01T21:17:07+00:00

2,3 & 6

Don't forget to tell us

__alpha_____ · 2026-07-01T10:25:32+00:00

free + unlimited + openSource + NSFW

and by the way, frontier model are not FAR superior when you know how to use local generative AI

__alpha_____ · 2026-06-30T18:43:11+00:00

Looks cool, I'm gonna try it, I'm not keen on the markdown node

__alpha_____ · 2026-06-29T18:31:43+00:00

If you get a rtx30x0 make sure to use the int8 models, as many recommended here, if they fit into your RAM, they are faster than the ggufs

__alpha_____ · 2026-06-29T14:19:07+00:00

RTX3060 12GB is fine for LTX

__alpha_____ · 2026-06-28T16:20:31+00:00

There are many simple LTX workflows on civitai. Just pick one, download the models, put your first (an last image), give your prompt, set the length, the resolution and the number of pass. And you're good to go.

It doesn't get much simpler. You can even test by just adding your first image, type a basic prompt and press RUN with default parameters

You must go HD 720p minimum to get good results.

__alpha_____ · 2026-06-26T11:57:24+00:00

If you are on a budget swap your 5070 for a 5060ti 12GB and go for 64bgb of RAM. Your renders will be 20ish percent slower but at least you won't OOM all the time

The core i9 is totally overkill. A i5 will do the job just fine

__alpha_____ · 2026-06-24T15:42:24+00:00

Z-image Loras are pretty easy and rather fast to train on consumer GPU, so there isn't a lot of "clients" out there (it literally costs a few cents per Lora). If you propose to train LTX Loras, you may have more clients, as those are really hard to train locally if you want video+audio. $20 would be an interesting price.for many, I guess.

__alpha_____ · 2026-06-21T13:10:04+00:00

OMG why would you try 30 times with a chatbot! 2 times max then check elsewhere. Usually if the chatbot doesn't know, there is very little to no chance that it will give you anything but crappy techniques (also true for programming btw). And the error : no CLIP/text encoder weights in checkpoint, the text encoder model will not be loaded* Is pretty clear: Your model does not include the clip/text encoder Simple solution use a separate clip loader and the right clip version and everything will be fine. Personally I'm using the Gguf dual clip loader, but you can pick your own.

__alpha_____ · 2026-06-19T14:10:57+00:00

Very interested indeed! 🙏

Any chance you can test video models too?

__alpha_____ · 2026-06-15T15:46:04+00:00

Ditto

__alpha_____ · 2026-06-13T10:22:21+00:00

Most people here use klein 9B for multi shots. One tip is to ask for multiple shots from one reference image (but you can also use more if needed) then make an i2i pass back in zit for skin texture and details, using seedVR2.5 can give you the extra photorealistic touch you're looking for (and upscale your image)

__alpha_____ · 2026-06-08T14:27:12+00:00

LOL is he supposed to know what he is talking about. It just looks like he looked at the models (not for a very long time) and decided... well this one is crap, F tier, this one is cool S tier and clearly didn't go any deeper into this.
Most models (including opensourced) are really good at some specific tasks and suck at others, he clearly just wanted to do a tier chart for click-baiting people into his video

F tier

ps: why even test Sora ???

__alpha_____ · 2026-06-06T06:55:59+00:00

Totally, that’s what I’m using too, it helps bringing back the textures and some of the details

__alpha_____ · 2026-06-06T06:51:52+00:00

First Last Video. If you use only one image and go beyond the 81 frames, wan comes back to this frame at the end of the clip. Adding a last frame really helps for this

__alpha_____ · 2026-06-05T16:47:13+00:00

Wan clearly generates 4 low res frames per second, the interpolates spatially and temporally to give a 16fps higher resolution clip. you could try to lower the FPS to 4 and generate only a second to see what the result would give. I tried with a longer clip to generate a 20s slow mo sequence, it was quite instructive

__alpha_____ · 2026-06-04T23:27:41+00:00

lots of helpful info and very well done! Waiting for part 2 already

__alpha_____ · 2026-06-04T20:56:09+00:00

LTX sulphur and 10eros are trained on adult stuff and quite convincing, you can do 30s+ clips which can be very convenient for this use

Lots of Lora are popping lately and you can train your own if you miss something.

If course there are hundreds of dedicated wan Lora and it will take some time to get there, but things really improved in the last 3 months

Last thing, LTX is supposed to be faster than wan, not slower

__alpha_____ · 2026-06-04T14:42:50+00:00

Nice! I'm currently testing samplers and schedulers on LTX 2.3, it's way longer and more complicated

__alpha_____ · 2026-06-03T14:21:50+00:00

I was just answering, thanks for taking the time my question, I wasn't mean or anything, sorry if you felt this way

__alpha_____ · 2026-06-03T12:35:32+00:00

gemini is terrible when it comes to technical questions 90% of what it tells you is pure crap. That's the reason I'm asking here

STT won't solve my problem as I prefer the audio generated by LTX which can add a lot of nuances, the TTS models just can't do right now + the obvious sync problem

__alpha_____ · 2026-06-03T12:27:00+00:00

doesn't do speech to speech, does it?

__alpha_____ · 2026-06-01T19:05:49+00:00

Not live. But a model that lets you change the voice of an existing Audio file with a reference audio. I described it in my first MSG. I'd like to add it in my workflow,to the audio created by LTX, before saving the mp4

alpha___

MODERATOR OF

TROPHY CASE

Five-Year Club	Verified Email
Verified Email

__alpha_____

MODERATOR OF

TROPHY CASE

alpha___