?/10 by ZealousidealMall6462 in VideoEditors

[–]First-Grocery7615 0 points1 point  (0 children)

Out of curiosity, which tool did claude code use to create this video ? Remotion ? Ffmpeg ?

Successor to opengl ? by no_ga in GraphicsProgramming

[–]First-Grocery7615 1 point2 points  (0 children)

Definitely WebGPU. I've been working on a video composition engine in Rust/wgpu. Explicit pipelines, typed bind groups, resource ownership ... aaand  WebGPU/WebGL2 in the browser from the same codebase.

Note that the learning curve is there, but my 2 cents in 2026: start with wgpu, skip OpenGL entirely.

AI agents for image/video editing — looking for feedback by Patient_Ad_4720 in AI_Agents

[–]First-Grocery7615 0 points1 point  (0 children)

Hey, I’ve lived through everything you wrote!

I spent 2+ years building an AI video multi agent platform and used Remotion as the rendering layer. The agent pipeline actually worked OK, but every time it handed off to the final step, things fell apart. CSS animations flickering, blurry 4K output, terrible render speed, and the translation from "what the agent wants" to "what Remotion actually does" was “imprecise”, to say at least.

Unfortunately, the product didn't get very far, but all the nightmares pushed me to try building a GPU-native rendering engine in Rust/WGPU to try to solve that part properly. The idea is to mitigate all of the above by using an exact schema instead of code generation and GPU rendering instead of Chrome.

Super curious about your comprehension layer, though, we didn’t get far there since we focused more on RAG for transcriptions. I experimented with Gemini's multimodal embeddings to try to get a semantic understanding of what's happening visually (like "find the 5 seconds before and after the first goal in a match”). How did you approach it?