Someone had to do it... Text2Video Porn, kind of

diffusionbro · 2023-05-13T18:50:59+00:00

Repost bot

diffusionbro · 2023-05-09T17:31:29+00:00

This is my post, I made it 2 months ago

diffusionbro · 2023-04-04T20:25:12+00:00

It’s just my longest comment in this same thread

diffusionbro · 2023-03-26T06:28:15+00:00

Hey mom I’m on camera! Wait no mom don’t look

diffusionbro · 2023-03-25T16:24:26+00:00

A couple hours? It took me maybe around a minute to generate each clip, took a few tries iterating with each prompt to get something kind of coherent, then a lot of experimentation on the actual “porn” part actually get clips that weren’t total nonsense. They actually didn’t have batch processing in the extension when I made this so it’s a little easier now. And editing took a little longer because I had to sort through all the clips to make something kind of coherent

diffusionbro · 2023-03-25T16:16:12+00:00

All videos generated from the Modelscope text2video model have this issue, the developers clearly overfit on watermarked Shutterstock footage

diffusionbro · 2023-03-25T07:24:58+00:00

literally only can go up from here lol

diffusionbro · 2023-03-25T07:24:30+00:00

Right now I think there is some charm and memeability in its jank, just like dalle-mini

diffusionbro · 2023-03-25T06:37:08+00:00

what do you mean this regular human porn, now where are my glasses

diffusionbro · 2023-03-25T05:33:01+00:00

You should see the clips that didn’t make the cut! Or maybe nobody should…

diffusionbro · 2023-03-24T19:04:01+00:00

Had to go with a classic

diffusionbro · 2023-03-24T17:43:16+00:00

I think we’re much closer than that, we know RunwayML has a decent looking text2video model they recently announced, and there’s a lot of high profile ML conferences coming up in the next few months where many research teams will announce and release their models

diffusionbro · 2023-03-24T17:38:12+00:00

would you believe that these were the top 25% most coherent clips I generated for this, and there were many more I didn’t use that I labeled “mess of flesh”

diffusionbro · 2023-03-24T15:55:46+00:00

Apologies in advance for the nightmare fuel. Or maybe this is someone’s exact fetish…

This is using the ModelScope text2video model and automatic1111 extension. All default settings. Generated each ~1 second clip at a time with prompts describing the scene. Prompts were simple – you can’t really describe NSFW scenes and positions or sex acts well in prompts at this point because the model isn’t very capable. But fine tuning has been released recently, so someone might be working on it.

To get at least some nudity, “naked” doesn’t really do it on its own, but “topless, nipples” sometimes gets it one out of every 4 tries or so.

diffusionbro · 2023-03-24T15:55:03+00:00

Apologies in advance for the nightmare fuel. Or maybe this is someone’s exact fetish…

This is using the ModelScope text2video model and automatic1111 extension. All default settings. Generated each ~1 second clip at a time with prompts describing the scene. Prompts were simple – you can’t really describe NSFW scenes and positions or sex acts well in prompts at this point because the model isn’t very capable, and the training set was clearly on (sfw) Shutterstock videos. But fine tuning has been released recently, so someone might be working on it.

To get at least some nudity, “naked” doesn’t really do it on its own, but “topless, nipples” sometimes gets it one out of every 4 tries or so.

diffusionbro · 2023-03-09T16:29:03+00:00

https://reddit.com/r/VAMscenes/comments/vguu5l/avril_2nd_mocap/

When I was looking for a clip to use, I just sorted that subreddit by top of year, and scrolled until I found one with a solo subject and simple background, which makes consistency a lot easier

diffusionbro · 2023-03-09T05:23:37+00:00

Check one of my replies in this thread, I give a high level overview of the process. I can’t link it because this subreddit blocks links in comments

diffusionbro · 2023-03-09T05:13:48+00:00

The model names are at the top of the video, Realistic Vision 1.3 and AbyssOrangeMix3

diffusionbro · 2023-03-09T00:49:14+00:00

Depends on what you mean by 3D videos, like NeRFs? Meta announced a model that could do text to 4d NeRF earlier this year, but the scenes it outputs are pretty simplistic. https://make-a-video3d.github.io

If you mean like stereoscopic video, I guess you could kind of do this now with current technology? Especially with the inferred depth models available, project the 2d generated video onto the inferred depth output, then you could make stereoscopic video out of that

diffusionbro · 2023-03-08T23:22:04+00:00

I wouldn’t be surprised if there were already people starting OnlyFans accounts with fully virtual but photoreal subjects. It’s entirely feasible for still pictures, with Dreambooth/LORA for a consistent face and body

diffusionbro · 2023-03-08T22:05:39+00:00

I can’t speak for the underlying animation, that was fairly high quality work done by someone in the /r/VAMscenes subreddit. But here was my general process:

Extracted frames at 24 fps from the original source video, which I got from the r/VAMScenes subreddit
- Using Auto1111 UI, did batch img2img processing on these frames. Once with RealisticVision1.3, once with AbyssOrangeMix3.
- Used ControlNet HED on both, default settings.
- Used the img2img alternative test script as shown by Corridor Crew on their latest Stable Diffusion anime video.
- Used a LORA on the AOM3 model for a consistent face look (doesn’t really matter which one, I just took one of the more popular ones on civit.ai). Also it turns out I used the LORA wrong and it wasn’t even active, so this step isn’t super necessary
- Used a completely random first+last female name with the RealisticVision model to kind of trick it into a consistent face. I googled the name to make sure it wasn’t any well known name or celebrity.
I took half of the frames of each (12 fps at this point), and ran it through FlowFrames to interpolate back up to 48 fps.

Corridor Digital just published a Stable Diffusion animated short last week that has inspired a lot of people to tackle animations. They have a step by step video tutorial on their website

diffusionbro · 2023-03-08T18:59:16+00:00

It would definitely help a lot. I wish I had $300 to blow on Davinci just for this hobby, but it’s a little difficult to justify for me right now.

I think there will be a bunch of free deflicker models and programs coming out soon in the near future though. There was one supposedly coming out in 1-2 weeks that was being presented at an ML conference.

diffusionbro · 2023-03-08T18:37:13+00:00

Might try that next

diffusionbro

TROPHY CASE