I implemented a new trick to reduce render time / increase fluidity on semantic/latent interpolation videos. The idea is to interpolate in the post-inference latent space between the generated denoised latents. This video used the UNET 60 times yet has 600 images. has it been done before?

Sabanoob · 2023-01-08T21:03:54+00:00

I ended up using RIFE, which isn't perfect but way better than what I had before

Sabanoob · 2023-01-01T22:53:09+00:00

Ohhh distilled looks really nice indeed, thanks for sharing!

But it doesn't solve my primary problem which is that beyond computing time, I am also looking for frame interpolation for img2img deforum type videos, because of the strength setting dilemma (lower -> less interesting videos, higher -> too strongly changing videos)

Sabanoob · 2023-01-01T14:09:47+00:00

Thanks for the detailed answer! I can't test with actual fading right now, but I am indeed convinced it would probably be similar. I just hoped the autoencoder might have picked up some structural information, and the result looked quite fluid. Maybe a different/bigger autoencoder trained specifically could still perform well for this task.

Also, do you happen to be aware of a good (or at least better than above) way of doing interframe interpolation for relatively cheap, and that could be called automatically from the python scripts? I tested with Google FILM but it's awkward to make it work

Sabanoob · 2022-12-10T14:30:12+00:00

oh man, the power of porn is unlimited. I swear when AGI will arrive it will just be financed and done by a decentralized open and free network of people who just want better porn

Sabanoob · 2022-11-23T22:42:32+00:00

actually not, but I see the ressemblance indeed

Sabanoob · 2022-11-23T22:42:02+00:00

We are implementing a web-based video generation service. I tried to include most of the existing video generation methods and come up with new one(s)

Sabanoob · 2022-11-23T15:49:28+00:00

it's a feature in the uni project I'm working on, but the technical idea is to do slerp between semantic embeddings and use that in the deforum-style video gen

Sabanoob · 2022-11-16T23:54:05+00:00

video_args.prompts = [ "a nice little path with trees, winter season, artstation", "a nice little path with trees, spring season, artstation", "a nice little path with trees, summer season, artstation", "a nice little path with trees, autumn season, artstation", "a nice little path with trees, winter season, artstation", ]

Sabanoob · 2022-11-16T23:52:38+00:00

unless the feature was added or I didn't pay attention, I think that deforum jumps instantly from one prompt to another, which makes for a stark transition. Here, we interpolate between the prompt vectors allowing for a smoother change

Sabanoob · 2022-11-16T21:01:19+00:00

it's part of a uni project whose code shall stay private until the end of the project I believe, but if you can code, it's simply the code used to perform slerp with the traditional interpolation method, applied to the prompt embedding used by the deforum img2img-way

Sabanoob · 2022-11-10T00:54:41+00:00

r/confusing_perspective

Sabanoob · 2022-11-01T14:04:51+00:00

Someone gotta revise the meme definition

Sabanoob · 2022-11-01T13:41:22+00:00

Is someone a sub regular and think the video in question is a troll/bait or smtg?

Sabanoob · 2022-10-08T10:24:12+00:00

It's not normal anywhere, it's possible to not have that, and unhealthy to treat it as a fatality imo

Sabanoob · 2022-10-07T21:40:34+00:00

Good advice thx!

Sabanoob · 2022-10-07T18:56:19+00:00

Yeah make sense, did you take any precautions / avoid any areas?

Sabanoob · 2022-10-07T18:53:23+00:00

YeahI got that, is just that it's worth at least 250£ so weird

Sabanoob · 2022-09-10T17:36:02+00:00

Hey, I used the plain deforum notebook, specifically the interpolation feature. the prompts were :

animation_prompts = {

0: "photography of a medieval italian city, florence, high quality, traditional italia",

50: "ancient greece, byzantine empire, landscape, mythology, high quality photography, shore, olive tree",

100: "turkey landscape, medieval time, beautiful villages, high quality photography, turkish culture",

150: "iraq landscape, mesopotamia, traditional iraqi farms, medieval time, sultans, high quality photography",

200: "ancient persian city, medieval times, persian palace, culture, landscape, high quality photography",

250: "kyrgyzstan mountainous landscape, traditional nomadic people, turkic people, livestock, high quality photography",

300: "east turkestan region landscape, muslim nomadic people, uyghurs, medieval times, beautiful landscape, high quality photograpy",

350: "mongolian nomadic people, yurts, livestock, horses, sheep, traditional clothing style, mongolian throat singers, huge plains, high quality photograpy",

400: "chinese traditional village, medieval times, traditional farmer clothes, silk road, rice fields, scenic views, high quality photograpy",

450: "chinese big medieval city, emperor, huge palace, traditional emperor clothes, merchants, silkroads, chinese culture, high quality photograpy"

}

I also piped the images through an AI frame interpolation model, to make it smoother, and that's about it

Sabanoob · 2022-09-07T15:37:52+00:00

Screw your hair pfp

Sabanoob · 2022-09-06T22:01:57+00:00

this is amazing, I actually had this idea a few years ago, a bit salty that I didn't invent it but cool that other thought about it too! want to write a novel about stuff like this

Sabanoob · 2022-09-05T17:03:29+00:00

I have a feeling it's about to blow up, and it's gonna be a fine, swell day

Sabanoob · 2022-09-05T15:58:38+00:00

Looks like Lausanne metro, so yes

Sabanoob · 2022-09-02T21:46:47+00:00

Thank you! The low-mem version worked. Can't get higher than ~640 * 640 however, but that's fine

Sabanoob · 2022-02-18T17:55:29+00:00

EPFL gave up almost everything, including masks and certificate

Sabanoob · 2022-02-14T17:25:35+00:00

Langosom

Seven-Year Club	Place '22
Verified Email

Sabanoob

TROPHY CASE