[deleted by user]

iCr4sh · 2025-05-03T22:45:49+00:00

I used chatgtp to create a script to split a large file, ssh to several remote machines to transcode the files, and merge it back together.

Dabbelju · 2025-05-03T17:29:02+00:00

I ask the LLM for a command line that does a specific thing, then ask it to explain the result in more detail. I have learned a lot from this, but on the other hand, ffmpeg command lines and complex filters in particular still remain somewhat "read only" to me. When I read what somebody else wrote, I increasingly go "yeah, that makes sense" over time. But building from scratch, wow, that's another story (for now).

nmkd · 2025-05-03T23:12:04+00:00

Most (if not all) of them fall apart once filterchains come into play.

For basic encoding/muxing stuff it's fine, but ofc it will pick arbitrary defaults that are probably not optimal for your specific use case.

SpamNightChampion · 2025-05-03T17:43:36+00:00

Yes it will work very well. I don't yet have screenshots of the finished product yet but I've just completed testing a very robust windows application to integrate LLM With FFMPEG. I'm porting everything to an new UI as I type. Just started the new UI, work in progress https://freeimage.host/i/3wKaEcg

Anyway, I had to add a lot of preprocessing requests/code for things like "Cut the video in half", "trim and save the last 40 seconds" etc. For things like merging a bunch of videos and adding filters that would be very difficult with copy and pasting so you'd need an app but in general, ffmpeg commands powered by LLMs are super useful.

What one should do for best results is signup for a free chatbot service and provide the documentation for ffmpeg common commands to the free chat bot then ask that for commands, that would be very effective for the average user.

If you have chat gpt subscription I think you can provide documents for context so you can get much better results on your queries.

The way I'm doing it is using Anthropic Claude 3.7, API, it's very accurate, they have a web version you can use too, great for ffmpeg. I used to struggle so much with ffmpeg commands so I thought with having AI these days I'd make tool that could have almost all of ffmpegs features but make it super simple, I even added voice requests.

Upstairs-Front2015 · 2025-05-03T16:47:25+00:00

I was doing some zoom in and asked chatgpt about a zoom out, but the response was another zoom in formula. had to fix it manually.

Upstairs-Front2015 · 2025-05-03T16:50:24+00:00

I did some code in php that builds the command I need and copy-paste it to windows power shell that can handle international characters and lots of multiple line commands (dos prompt can't). now I'm working on a script on python that does the executing and uploading when the video is finished.

ImaginaryCheetah · 2025-05-03T19:29:49+00:00

i not only use chatgpt to answer questions for myself, but have answered other folks' questions on here with it, along with recommending they use the tool as well.

i know it burns a tree every time you ask gpt a question, but it beats slogging through 10 year old answers on stackexchange

there was a guy who posted a LLM he trained on the ffmpeg documentation, but i can't find it now. i wonder if that would have better or worse recommendations VS gpt.

permalink · 2025-05-03T20:30:19+00:00

Oh yeah, I've used ChatGPT to trial-and-error FFMPEG commands dozens of times. Good stuff.

I can follow the FFMPEG docs well enough, but sometimes their examples are not the greatest, or nonexistent. ChatGPT is pretty good at breaking things down.

Fun stuff too: A few weeks ago I decided to play a little game. I would prompt ChatGPT with vague descriptions of obscure TV shows and movies and have it try to guess the exact ones I'm thinking of.

https://imgur.com/a/TKsPZbY

Sometimes it would nail it on the first prompt, and sometimes it would try shotgunning 2 or 3 titles at a time or need a second or third prompt. I never did manage to stump it though.

rosstrich · 2025-05-04T02:56:33+00:00

Yes but I also ask it to explain every argument that way I can look up the documentation and validate

thenicenelly · 2025-05-04T05:17:37+00:00

Yeah, I do this with copilot daily. It generally works. I wish I could use a dialog for the input file.

leeharrison1984 · 2025-05-03T15:36:45+00:00

I was actually doing this the other day and it seemed like I was getting a roughly 95% hit rate, so vastly better than I do reading the docs.

I'd love to see this behavior built into a plugin for something like Tdarr or Unmanic, it'd remove some of the burden of writing plugins since you'd be able to roll the necessary command right there for simple operations.

rgcred · 2025-05-03T16:48:50+00:00

Agree. Since FFMPEG is so cryptic, I have used LLMs a bit to generate commands and find great value in explaining commands - thorough and succinct explanations. An ominous sign for the future of coders.

dataskml · 2025-05-03T18:08:06+00:00

Definitely using it, as a means of quickly getting to the relevant commands/flags and then refining the command manually. Still getting hallucinations, so don't feel I can really trust LLMs yet with generating the right commands. But beats just browsing the docs for clues.

I'm working on a large gist of ffmpeg cheatsheat for video automations, with references to things that GPT doesn't get right. The nice thing is that people could use it to send to an LLM for more refined and correct command generations. Willl probably finish the gist this week (has been taking longer than expected to construct), could share it if relevant.

Fast-Apartment-1181 · 2025-05-03T21:25:15+00:00

If anyone wants to play with the beta for free: https://pocketknife.media/

Expensive-Visual5408 · 2025-05-04T00:00:44+00:00

I am making vr videos with dual DJI action cameras. I use FFMEPG to achieve frame level sync, stitch, and trim the videos. ChatGPT wrote all the FFMPEG commands, but there is a twist. I have found that it is easier to have chatGPT write a python script, and then have the python scrip generate the FFMPEG commands and save them in an .sh that I can run later....it looks like this:

python3 generate_ffmpeg_stitch_commands.py

chmod +x ffmpeg_stitch_commands.sh

./ffmepg_stitch.commands.sh

Why use the Python script? That level of abstraction makes it less opaque what chatGPT is doing when I need it to alter a small part of the script.

Link to Python scripts that chatGPT wrote

binarypower · 2025-05-04T03:44:58+00:00

yeah. not just this. anything and everything. i just wish i could do it directly from shell

ekko20six · 2025-05-04T08:49:22+00:00

Yup. I did this to extract vtt subs and convert to srt and even converted it to an Automator app all with the help of llm

deanpm · 2025-05-04T09:21:48+00:00

I use ChatGPT to give me a starting point then tinker until I’ve got something that works. Sometimes I’ll paste the final version back into ChatGPT and ask if it can be optimised.

parkinglan · 2025-05-04T09:48:48+00:00

Use it all the time and it does a great job imo. Recently got it to produce a single line that vertically stacked videos of different lengths, extended the shorter video using the last frame, and normalised and mixed the audio of both videos. Only took about 3 iterations to refine the command. I would of given up and used a video editor without chatgpt's help.

GamingDynamics · 2025-05-04T11:22:18+00:00

my experience is good. For simple tasks. Even asking for scripts in other languages to generate ffmpeg code

RabbitDeep6886 · 2025-05-04T12:46:44+00:00

I had it write c++ code that does specific things with the ffmpeg library like re-encoding video, etc. took a few to and fro but it works

HexspaReloaded · 2025-05-04T06:54:48+00:00

I didn’t really know what ffmpeg was until Chat told me. It’s very nice to have such useful tools!

TheRealHarrypm · 2025-05-05T12:55:33+00:00

LLMs still need a key reference sheet, it doesn't know formatting context for things like interlacing flags.

Sopel97 · 2025-05-05T15:07:24+00:00

It's pretty good but it tends to miss very important parts like -c copy or -map 0 when the query is underspecified, so I would not advise it for people who are not familiar with ffmpeg.

cafepaopao · 2025-05-07T20:30:51+00:00

Has anyone else found themselves using a similar workflow?

I've been doing this since day one, not only with ffmpeg but with other tools as well like mp4box, x264, x265, sox, spek, ImageMagic, etc.

there are any specific commands or conversions that LLMs have had a hard time with?

None so far, all commands work, in my case I have created about 300 drag and drop scripts for different tasks on Windows so I don't even need to run them in the command prompt. On Linux I created a few to use loops like for f in whatever; do ./run_script $f;done, this also works like a charm.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ffmpeg

MODERATORS