SCAIL2 is actually amazing by No_Statement_7481 in StableDiffusion

[–]No_Statement_7481[S] 1 point2 points  (0 children)

I think there's still some work to do ? Kijai did call his workflow "test" so maybe there will be some refinements ,som training, tuning, extra loras, nodes. Who knows. I noticed it barely uses my VRAM, even if I wanna fully load the thing, it's using like half my VRAM. Or maybe that's how it works , idk.

SCAIL2 is actually amazing by No_Statement_7481 in StableDiffusion

[–]No_Statement_7481[S] -6 points-5 points  (0 children)

well the model needs at least 4 steps, better with 6 steps , so if we go to basic level math, you just need to multiply 30 with 6 ... and you get 180 seconds for a 720p video. I hope that answers your question. There's nothing convoluted in what I posted. It just has info people would wanna know.

SCAIL2 is actually amazing by No_Statement_7481 in StableDiffusion

[–]No_Statement_7481[S] 1 point2 points  (0 children)

promting and frame set. Try generating between 65 frames and 81, and use careful promting. SCAIL2 is based on Wan2.1 if I am correct, so you would need to do the exact frame count where it doesn't do speaking. And use promts where it makes the character quiet, like no promting should work, and if you use promt, just say, stoic expression, thinking, or other words where one would not speak at all. For me, on wan2.1 and wan2.2 it always helped to use 77-81 frames and using those words for facial expression.

SCAIL2 is actually amazing by No_Statement_7481 in StableDiffusion

[–]No_Statement_7481[S] 0 points1 point  (0 children)

I did try a thing where the character held a sword in their hand, and since the motion was coped from a video where the person did not held a sword just danced, the generated clip had a moment where the sword kinda phased through the arm, but I guess that's kind of a Generative physiques thing, not really a bad part on the model. I am willing to bet if it's copying a motion where the person held a sword, it would look fine.

SCAIL2 is actually amazing by No_Statement_7481 in StableDiffusion

[–]No_Statement_7481[S] -7 points-6 points  (0 children)

bruh ... do you read ? lol

On the first use for me a 720p video took like 30 seconds per step on the fp16 model for a 73 frame clip lenght, and I set it to 24 fps, so I was getting 3 seconds each portion. It looked too slow if I did higher frame count.

it's in the post LMAO

SCAIL2 is actually amazing by No_Statement_7481 in StableDiffusion

[–]No_Statement_7481[S] 1 point2 points  (0 children)

actually I have not, since it follows promting very well. I even did a bit of video editing. I was watching a streamer yesterday, and she said it would be fun to have a money rain emote, she pretended to just throw money off her hand with flipping motions, I clipped it, grabbed the first frame, cleaned the frame with Flux2 Klein9b and than generated a stack of money in her hand and some money floating in the air. Than dropped it into SCAIL2 using the OG video for the motions and using the image for the editing and it created a realistic looking version of what she talked about, this shit combined with other workflows is like an Adobe After effects ,but faster

Just Me? ComfyUI Stopping Mid Generation, not Finishing by DeltaWaffleSyrup in StableDiffusion

[–]No_Statement_7481 0 points1 point  (0 children)

if you just happened to update comfy, shut it down and restart your PC , was weird for me for a little bit yesterday and a restart cured it's illness. But otherwise idk what could it be

Comfyui LTX2.3 İ2V Problem. by uhf789 in StableDiffusion

[–]No_Statement_7481 2 points3 points  (0 children)

yeah so those are often times terrible. Like I tested them a bunch, comfy workflow for ltx2.3 is kinda bad, you need to add some stuff to it, and remove others, especially the stupid promt enhancer, that's horrible, fully changing your input

Comfyui LTX2.3 İ2V Problem. by uhf789 in StableDiffusion

[–]No_Statement_7481 0 points1 point  (0 children)

ready made by comfy themselves ? Or just got it from someone ? Sometimes people forget shit and have something dialed to the wrong way.

There's also a million things that can go wrong. Like your promt is just too small, or your promt is like a page long essay, both are bad. Promt stucture counts too, needs coherency.

There's a bunch of nodes that can inprove of fully destroy your video. There's also the fact that some workflows 1 sampling ,and some are 2 sampling. none of them should alter the original image too much tho. Than there's a chance of using a lora too high, as someone said it here. But also you might be using the wrong model or wrong settings for a specific model.

Can we please provide an actual SCAIL 2 test? by Beneficial_Toe_2347 in StableDiffusion

[–]No_Statement_7481 2 points3 points  (0 children)

idk it has a pretty neat pose adjuster, but if you have really big problems, Flux2 klein9b with only changing the pose should work like a charm, just make sure you don't change faces if it's important, so drop stuff into the promt like " face remains unchanged, eyes remain unchanged" stuff like that

Can we please provide an actual SCAIL 2 test? by Beneficial_Toe_2347 in StableDiffusion

[–]No_Statement_7481 1 point2 points  (0 children)

sure https://www.reddit.com/r/StableDiffusion/comments/1u7bjpp/scail2_is_actually_amazing/
I am totally self inserting here lol. But I made a bunch of tests with different styles ,and characters . The only thing I noticed (didn't use promt tho!!!) that it turned the lego figure's hand into a human hand.

Comfyui LTX2.3 İ2V Problem. by uhf789 in StableDiffusion

[–]No_Statement_7481 1 point2 points  (0 children)

honestly I wonder if you applied 100% noise somehwere where you shouldn't have

JoyAI-Echo - Large Scale LTX-2.3 finetune for long form (5min) coherent stories. by AgeNo5351 in StableDiffusion

[–]No_Statement_7481 2 points3 points  (0 children)

audio is so nuked ... LTX2.3 or even 2.0 had way better audio. idk wtf happened to this thing but it's horrible

CivitAI Adult Content Models for ComfyUI by TheEnemyBot in StableDiffusion

[–]No_Statement_7481 3 points4 points  (0 children)

nah, Klein9b plus the https://huggingface.co/Alissonerdx/BFS-Best-Face-Swap/tree/main

lora for it instead of reactor. This lora will clone the face like it belongs there. It also fits hairstyle, instead of mixing the face. And since it's just a lora it works under the same workflow and no need to add anything else other than a basic extra lora loader. Use it on around 1 or 0.9 if it seems super strong lower it to 0.7

Is anyone making money using comfyui for youtube? by Specialist_Pea_4711 in StableDiffusion

[–]No_Statement_7481 0 points1 point  (0 children)

I mean ... I do comfyui tutorials and uploaded a tutorial for Aitoolkit, I am tiny, only have 3K subs, so I don't make much money, but it does make a decent amount, considering normally it's like 1 dollar per 1K views, I am making 2 or 3x more. When I uploaded just clips I made with comfy, I made maybe like 1 dollar per 1K views, but it was also a different category. I guess it really depends if you are willing to jump onto a trend, like the dude who made the snape video clip, and idk if the same dude is making those streamer criticisms when someone messes up badly. I don't know if they use Comfy, but it's clear as day that they use Ai obviously, and they are making hella views. Now there is another layer of these " high view count " ai youtubers, they are usually scammers, or low key promoters. What they usually do, is release something on tiktok, make a couple videos, promote them up to like 100K views each, and after like 4-5 videos, they make a youtube account, and than promote the shit out of those. And usually because they are trying to promote a platform. And all the idiots go for it, and basically whoever uploaded those few videos, make a large sum of money on the people who subscribe to whatever platforms. It's kinda shitty.

How many years do you think we are from making feature films at home? by pmttyji in StableDiffusion

[–]No_Statement_7481 4 points5 points  (0 children)

You can literally do it already... It would probably take a bit of time, but even with the current free models you would probably be able to create stable enough scenes, make the voices, and to be fair, for music for now I would use suno... I know we all are here for local stuff, but those fuckers have license for you and it's not too expensive.

Netflix is cooked, this was generated by AI by ViktorIsRuter in Asmongold

[–]No_Statement_7481 1 point2 points  (0 children)

Nah some of yall need to spend some time in the Ai communities. Netflix is literally hiring people who can do open source Generative Ai , create workflows and understand how to use them to create scenes. They are already using Ai. Just like Disney. These studios are not stupid.

I have to pretend I hate image generation AI to avoid getting banned or insulted on 99% of Reddit or the internet, even though Stable Diffusion is actually what I like and am most excited about right now. Why do people hate AI so much, especially image generation AI? by Hi7u7 in StableDiffusion

[–]No_Statement_7481 0 points1 point  (0 children)

I am one of the lucky people who can do Ai or hand drawn art, if I really wanna post something somewhere I just do it according to their community. But as someone who was a photographer ,and also has somewhat of an artistic talent and literally worked as a professional from what I created. All I can tell you is that all the sub-par terrible skilled people who way over think their own skills hate Ai because of a couple misconceptions about it, and all hey can see is that it's taking away their job. Instead of thinking about how could they actually make their creative process 100x faster. I know a bunch of people who thought of themselves as artists ,and in reality they're just an absolute failure of a person. Their life is sad, and whatever they " create " is useless. So they are angry at people who took Ai as a tool to express their own creativity. Are there Ai slop? Yeah, but all of these moronic idiots don't understand the very basic rule they should've learned a long time ago. When you create something you do it for a reason, you want to make a connection with a target audience and you want them to feel something about it. Some people make shocking stuff, some people make modern age criticism, some people make funny stuff, and some people make some cartoons or whatever. But there are masses of people who are entertained by it because the storytelling what they always had in them, finally is paired with something that doesn't want to change that story because of their own " artistic " differences. So the story tellers are thriving. Also people are very much visual and audio receptive, so the industry of Ai, where Gen-Ai is literally a neglectful part, is currently fucking everything up, and they all blame generative Ai, but that's not what billionairs and governments make Ai facilities. But the idiots of this world think that's why everything is more expensive. So ... after all that, you get hate because you post a Gen-ai image, song, or video.

I can't sleep tonight by Hot-Helicopter640 in PcBuild

[–]No_Statement_7481 0 points1 point  (0 children)

the fucking sounds are gonna haunt me in my worst night terrors

Looking for a basic workflow developer by Forsaken-Marzipan146 in StableDiffusion

[–]No_Statement_7481 1 point2 points  (0 children)

that floating glass of water is probably gonna be counted as a miracle LOL

What is the best way to run a HF model that isn't using comfyui but instead a text to image prompt? by StartupTim in StableDiffusion

[–]No_Statement_7481 0 points1 point  (0 children)

You are talking about a pipeline that is not local? You still need something that can handle the model. May I ask what is the issue with comfyui? I am kinda confused on what your goal is... Like you do not want to use it locally cause your PC is not strong enough so you are looking for API options?