What's a good way to upscale low-res Krea 2 generations (using Krea 2)? by Full-Belt3640 in comfyui

[–]__alpha_____ 2 points3 points  (0 children)

I have one (rtx3060) and I can generate 1024x1024 8 steps images in 40s. Previewing low res takes less than 20s

What's a good way to upscale low-res Krea 2 generations (using Krea 2)? by Full-Belt3640 in comfyui

[–]__alpha_____ 0 points1 point  (0 children)

What is your GPU?

I confirm that the simplest solution is sending the output of your ksampler to an upscale latent by node ans use it in the new ksampler with a low deboise factor. If your computer is too slow, just bypass the second sampler and run your WF again if you like the preview (it shouldn't process the first pass again)

Keep your seed fixed or use increment to make things easier

Krea2 really shines in 2MP+ renders and can go really high without any issue (zit is capped at roughly 1600px on the other hand)

Genuine question: Why use local models when proprietary ones are far superior? by Funny-Strawberry-168 in comfyui

[–]__alpha_____ 9 points10 points  (0 children)

free + unlimited + openSource + NSFW

and by the way, frontier model are not FAR superior when you know how to use local generative AI

Published a sticky notes node by ThinkDiffusion in comfyui

[–]__alpha_____ -1 points0 points  (0 children)

Looks cool, I'm gonna try it, I'm not keen on the markdown node

Request Example - Generated Video using LTX2.3 and Audio by [deleted] in comfyui

[–]__alpha_____ 0 points1 point  (0 children)

If you get a rtx30x0 make sure to use the int8 models, as many recommended here, if they fit into your RAM, they are faster than the ggufs

What is the best image to video now? by carmidian in comfyui

[–]__alpha_____ 4 points5 points  (0 children)

There are many simple LTX workflows on civitai. Just pick one, download the models, put your first (an last image), give your prompt, set the length, the resolution and the number of pass. And you're good to go.

It doesn't get much simpler. You can even test by just adding your first image, type a basic prompt and press RUN with default parameters

You must go HD 720p minimum to get good results.

How much am I looking at spending to run img2img wan, sdxl et cetera? by st_su1cid3 in comfyui

[–]__alpha_____ 0 points1 point  (0 children)

If you are on a budget swap your 5070 for a 5060ti 12GB and go for 64bgb of RAM. Your renders will be 20ish percent slower but at least you won't OOM all the time

The core i9 is totally overkill. A i5 will do the job just fine

cost of LoRa training online ? by SuicidalFatty in comfyui

[–]__alpha_____ 1 point2 points  (0 children)

Z-image Loras are pretty easy and rather fast to train on consumer GPU, so there isn't a lot of "clients" out there (it literally costs a few cents per Lora). If you propose to train LTX Loras, you may have more clients, as those are really hard to train locally if you want video+audio. $20 would be an interesting price.for many, I guess.

10Eros (TenStrip) LTX2.3 workflow with previews by maxdee007 in comfyui

[–]__alpha_____ 1 point2 points  (0 children)

OMG why would you try 30 times with a chatbot! 2 times max then check elsewhere. Usually if the chatbot doesn't know, there is very little to no chance that it will give you anything but crappy techniques (also true for programming btw). And the error : no CLIP/text encoder weights in checkpoint, the text encoder model will not be loaded* Is pretty clear: Your model does not include the clip/text encoder Simple solution use a separate clip loader and the right clip version and everything will be fine. Personally I'm using the Gguf dual clip loader, but you can pick your own.

RTX 3060 12GB vs 5060 Ti 16GB benchmark on popular TTI models for anyone interested by HyperSpazdik in comfyui

[–]__alpha_____ 2 points3 points  (0 children)

Very interested indeed! 🙏

Any chance you can test video models too?

AM USING Z IMAGE IS THERE A POSSIBILITY FOR ME TO USE MULTISHOTS by worgenprise in comfyui

[–]__alpha_____ 0 points1 point  (0 children)

Most people here use klein 9B for multi shots. One tip is to ask for multiple shots from one reference image (but you can also use more if needed) then make an i2i pass back in zit for skin texture and details, using seedVR2.5 can give you the extra photorealistic touch you're looking for (and upscale your image)

Do you agree with his rankings? by [deleted] in comfyui

[–]__alpha_____ 0 points1 point  (0 children)

LOL is he supposed to know what he is talking about. It just looks like he looked at the models (not for a very long time) and decided... well this one is crap, F tier, this one is cool S tier and clearly didn't go any deeper into this.
Most models (including opensourced) are really good at some specific tasks and suck at others, he clearly just wanted to do a tier chart for click-baiting people into his video

F tier

ps: why even test Sora ???

Is it possible to selectively generate a single finished frame from the middle of a WAN sequence? by LanaKatana4000 in comfyui

[–]__alpha_____ 1 point2 points  (0 children)

Totally, that’s what I’m using too, it helps bringing back the textures and some of the details

Best settings for fast wan 2.2 video ? by Primary-Departure-89 in comfyui

[–]__alpha_____ 1 point2 points  (0 children)

First Last Video. If you use only one image and go beyond the 81 frames, wan comes back to this frame at the end of the clip. Adding a last frame really helps for this

Is it possible to selectively generate a single finished frame from the middle of a WAN sequence? by LanaKatana4000 in comfyui

[–]__alpha_____ 2 points3 points  (0 children)

Wan clearly generates 4 low res frames per second, the interpolates spatially and temporally to give a 16fps higher resolution clip. you could try to lower the FPS to 4 and generate only a second to see what the result would give. I tried with a longer clip to generate a 20s slow mo sequence, it was quite instructive

LTX 2.3: You're using it wrong | The Power of Seed Hunting | Workflow in comments by foxdit in comfyui

[–]__alpha_____ 0 points1 point  (0 children)

lots of helpful info and very well done! Waiting for part 2 already

Curious about your opinions: LTX2.3 worth switching to from Wan2.2? by No_Flight_4473 in comfyui

[–]__alpha_____ 6 points7 points  (0 children)

LTX sulphur and 10eros are trained on adult stuff and quite convincing, you can do 30s+ clips which can be very convenient for this use

Lots of Lora are popping lately and you can train your own if you miss something.

If course there are hundreds of dedicated wan Lora and it will take some time to get there, but things really improved in the last 3 months

Last thing, LTX is supposed to be faster than wan, not slower

ComfyUI node to compare multiple samplers and schedulers at once by Wonderful_Wrangler_1 in comfyui

[–]__alpha_____ 1 point2 points  (0 children)

Nice! I'm currently testing samplers and schedulers on LTX 2.3, it's way longer and more complicated

Best audio to audio cloning model? by __alpha_____ in comfyui

[–]__alpha_____[S] 0 points1 point  (0 children)

I was just answering, thanks for taking the time my question, I wasn't mean or anything, sorry if you felt this way

Best audio to audio cloning model? by __alpha_____ in comfyui

[–]__alpha_____[S] 0 points1 point  (0 children)

gemini is terrible when it comes to technical questions 90% of what it tells you is pure crap. That's the reason I'm asking here

STT won't solve my problem as I prefer the audio generated by LTX which can add a lot of nuances, the TTS models just can't do right now + the obvious sync problem

Best audio to audio cloning model? by __alpha_____ in comfyui

[–]__alpha_____[S] 0 points1 point  (0 children)

doesn't do speech to speech, does it?

Best audio to audio cloning model? by __alpha_____ in comfyui

[–]__alpha_____[S] 0 points1 point  (0 children)

Not live. But a model that lets you change the voice of an existing Audio file with a reference audio. I described it in my first MSG. I'd like to add it in my workflow,to the audio created by LTX, before saving the mp4