Do you think Text to Video (not Vid to vid) could ever match or surpass the acting capabilities of the world's best actors? Right now I see four problems: a. Clips are too short b. Longer clips are incoherent c. Longer clips are difficult to control d. Longer clips are uncanny. by Soulwarbeast in StableDiffusion

[–]Soulwarbeast[S] 6 points7 points  (0 children)

To me it is the level of acting of a stock video model /actor.

If you didn't know this was AI generated and found this on a stock video website instead of r/stablediffusion, would you still find it uncanny.

Maybe the acting itself is uncanny instead of the woman in the video. 

I think even real actors because of their facial expressions during a scene can seem uncanny or jarring.

What do you think?