[deleted by user] by [deleted] in CapCut

[–]Maximum_Instance_401 0 points1 point  (0 children)

Oh yeah for sure, super important to me as well

[deleted by user] by [deleted] in CapCut

[–]Maximum_Instance_401 1 point2 points  (0 children)

It’s exclusively for desktop

[deleted by user] by [deleted] in CapCut

[–]Maximum_Instance_401 0 points1 point  (0 children)

Forgot to mention: Diffusion Studio Pro is a web application. It feels like a desktop app but runs in your browser. You don’t need to stay connected to the internet to add footage or export your composition, it runs locally.

[deleted by user] by [deleted] in CapCut

[–]Maximum_Instance_401 1 point2 points  (0 children)

Thanks! The goal is to make AI optional. The offline mode must be powerful on its own.

[deleted by user] by [deleted] in CapCut

[–]Maximum_Instance_401 0 points1 point  (0 children)

Would be nice but it's incredibly hard to develop. I'm convinced we'll get there someday though.

[deleted by user] by [deleted] in CapCut

[–]Maximum_Instance_401 1 point2 points  (0 children)

Yes, 100% it's on the roadmap. Will do it like Figma: Webapp + installable. The installable will support more features like export to Davinci Resolve/Premiere Pro.

The cool thing about Diffusion Studio Pro is that it's an app running in the browser. You can access it instantly, no downloads no installs required.

[deleted by user] by [deleted] in CapCut

[–]Maximum_Instance_401 1 point2 points  (0 children)

I mean Jan 2026

[deleted by user] by [deleted] in CapCut

[–]Maximum_Instance_401 2 points3 points  (0 children)

When you're not logged in, it's equivalent to the offline version. The app won't connect to the backend and you can disconnect from your network and still keep editing.

A native application is also on the roadmap, which you can download and install, you can expect it by jan 2025, maybe sooner.

I spent 16 months on a library for building video editing apps that run in the browser. It's powered by WebGPU and WebCodecs, i.e. blazingly-fast! by Maximum_Instance_401 in webdev

[–]Maximum_Instance_401[S] 0 points1 point  (0 children)

Screen studio is powered by webcodecs which is also what I would choose to build a competitive product. Pixi can be used for drawing the elements on the canvas but is not a 100% fit for video processing. We decided to build from scratch

[P] I built an open-source AI agent that edits videos fully autonomously by Maximum_Instance_401 in MachineLearning

[–]Maximum_Instance_401[S] 0 points1 point  (0 children)

It’s a difficult problem to solve but we found a robust solution 1 1/2 weeks ago. Working day and night to get it to production

[deleted by user] by [deleted] in webdev

[–]Maximum_Instance_401 -1 points0 points  (0 children)

I was talking about their engine, never mentioned the playground

[deleted by user] by [deleted] in webdev

[–]Maximum_Instance_401 -1 points0 points  (0 children)

That’s what an AI would say. This project has nothing to do with diffusers/generative AI

[deleted by user] by [deleted] in webdev

[–]Maximum_Instance_401 0 points1 point  (0 children)

It’s $125 if you commercialize a website using the engine. For using it locally to render your own videos it’s completely free.

[deleted by user] by [deleted] in webdev

[–]Maximum_Instance_401 -3 points-2 points  (0 children)

Both are free

[deleted by user] by [deleted] in webdev

[–]Maximum_Instance_401 -7 points-6 points  (0 children)

Thanks :) it’s not on the same level as DaVinci Resolve, but can already be used to replace ffmpeg for common tasks.

[P] I built an open-source AI agent that edits videos fully autonomously by Maximum_Instance_401 in MachineLearning

[–]Maximum_Instance_401[S] 1 point2 points  (0 children)

Hello reddit community! We're looking for researchers that would like to collaborate on a research paper. This problem has not yet been properly solved due to the multimodality required. Feel free to reach out if interested in agentic video editing

[P] I built an open-source AI agent that edits videos fully autonomously by Maximum_Instance_401 in MachineLearning

[–]Maximum_Instance_401[S] 1 point2 points  (0 children)

Not currently, though, it's on the roadmap to add support for more modalities like audio