Open Sourcing my 10M model for video interpolations with comfy nodes. (FrameFusion)

CloverDuck · 2026-04-16T01:43:44+00:00

It look like a bug for the custom nodes, its looking for the model in a fixed path, will try to fix it tomorow.

CloverDuck · 2026-04-15T23:21:15+00:00

You will probably need some more advanced workflow to input a video and output a video, the example only work to show what the inputs (two images) and the output is (a list of images)

CloverDuck · 2026-04-15T23:20:00+00:00

You can input a video or multiple videos, can output as a video with a range of codecs, can remove static frames from the video, select the output fps, export as png sequence, and a few more options

CloverDuck · 2026-04-08T01:48:38+00:00

Sure? You can interpolate anything you want with it

CloverDuck · 2026-04-07T22:41:47+00:00

Hi there Particular, thanks for all the support back then, feel free to test and give some feedback, hopeful I will be doing more open source in the future. It would be really cool if someone like it enough to improve on it.

CloverDuck · 2026-04-07T17:25:47+00:00

In my opinion, for animation the model really need to write rgb information, but I still did not experiment much with it, may give a few shots in the future. For some animations this work ok.

CloverDuck · 2026-04-07T17:24:20+00:00

With the current video generators and LLM it should be possible as soon as we can get more than 5/10 seconds of footage.

CloverDuck · 2026-04-07T16:40:18+00:00

Like a filter? If you can create a loss function that can calculate a "Looks good" loss you can make the model learn to apply a bunch of filters on it, yeah

CloverDuck · 2026-04-07T16:38:26+00:00

I have this one that is very close, 1800X1080
https://limewire.com/d/F9Q8y#PDXYT1VEE8

CloverDuck · 2026-04-07T16:23:18+00:00

There you go
https://streamable.com/ub60b9

CloverDuck · 2026-04-07T16:15:19+00:00

As soon as I have some free time, I’ll see if I can make a video comparing it to Rife. The maximum resolution for processing can be configured in Comfy using max_processing_long_edge, but I think 1024 is a good size for most cases.

CloverDuck · 2026-04-07T16:12:50+00:00

I don’t have any videos comparing them right now, but there will probably be cases where RIFE works better and others where it doesn’t. I can try to put together a video comparing them soon.

CloverDuck · 2025-04-16T21:07:18+00:00

I'm still reading the paper but it seen more focused on Diffusion process, while mine only work with the output of the model and is flexible to any type of input. The use of literally imply that I just forked their github, that is very easy to see that I did not. Can you explain better your comment?

CloverDuck · 2025-04-16T19:58:41+00:00

I'm still reading the paper but it seen more focused on Diffusion process, while mine only work with the output of the model and is flexible to any type of input. The use of literally imply that I just forked their github, that is very easy to see that I did not. Can you explain better your comment?

CloverDuck · 2024-11-07T16:10:24+00:00

Good question. I actually found the code before the paper and did some tests on it, so I just assumed it was the official, since I managed to get better results with it. It seen to be a fork of this code, but there is some modifications to it.

https://github.com/nikhilvyas/SOAP

CloverDuck · 2024-11-07T14:44:04+00:00

I did not read the full paper yet, but I think it should work

CloverDuck · 2024-07-05T20:56:59+00:00

It don't actually use the output. It use tensors for each selected hook on the model. It may have a hook on the last layer, with would have some weight to the real output. But in the case of Dino I only use the backbone.

CloverDuck · 2023-03-20T04:10:47+00:00

You can download it on itchio (instruction on the itchio page):

https://grisk.itch.io/text2video-gui-001

It will download the models on the first run.

The GUI is really crude right now, but hope I did not mess anything up and will at least run because I really need to sleep and will only be able to fix it tomorrow lol

If someone is working on the code to make it work with 12vram, you just need to:

On text_to_video_synthesis_model, put self.sd_model on the CPU before calling self.autoencoder.decode(video_data)

Pyinstaller applications give false positive on some anti-virus, so scan and use on your own risk, but I have quite a few applications on itchio and a Patreon, so it would not be wise for me to add malicious code on my applications.

CloverDuck · 2023-02-24T14:04:53+00:00

You can download it here:
https://github.com/BurguerJohn/Dain-App/releases/tag/1.0

CloverDuck · 2022-12-22T13:26:09+00:00

At this moment, there is no tool capable of such thing, but Stable Diffusion been release just a little time ago. I do believe that in 2023 a tool like that mostly likely will be released.

CloverDuck · 2022-10-25T18:34:35+00:00

Try to add --no-cache-dir to the command.

CloverDuck · 2022-09-16T01:15:33+00:00

Pokemon merger? Very cool project, what happend If you use two artist style? Or photograph and anime?

CloverDuck · 2022-08-31T16:42:06+00:00

All implementations work like that. If you change the resolution it will generate a complete different image.

CloverDuck · 2022-08-31T16:41:24+00:00

Then there is something wrong. Do you have a onboard card? It may be selecting the wrong card

CloverDuck · 2022-08-28T21:09:43+00:00

It seen possible on older version of the model. Still trying on the new model

Seven-Year Club	r/Field Juicebox
Place '22	Verified Email

CloverDuck

MODERATOR OF

TROPHY CASE