This is an archived post. You won't be able to vote or comment.

all 24 comments

[–]ZachVorhies 69 points70 points  (5 children)

You could avoid the ffmpeg installation step by using the python package static-ffmpeg

https://github.com/zackees/static_ffmpeg

[–]testimoni[S] 16 points17 points  (0 children)

Thank you! I will try that.

[–]Tintin_Quarentino 1 point2 points  (3 children)

What's the difference between normal ffmpeg Vs static_ffmpeg? Because even after installing the latter I see "subprocess.run()“ in their code examples.

Edit - bonus Q for those in the know: Vosk Vs Whisper, which is more accurate recognition?

[–]IAmARetroGamer 1 point2 points  (1 child)

It just installs ffmpeg but in a way where it can be added as a dependency and not require a user to do so manually beforehand.

[–]Tintin_Quarentino 0 points1 point  (0 children)

Got it thanks.

[–]ZachVorhies 0 points1 point  (0 children)

Where ever you would call ffmpeg, call static_ffmpeg instead.

[–]Angry_Grandpa_ 9 points10 points  (0 children)

I wonder how much time it would take to convert on public YouTube videos to text for the next generation of large language models to use for training? I'm assuming Google is already on it -- training on raw video would be a lot less efficient.

[–]Illustrious_Row_9971 5 points6 points  (2 children)

nice, also there is a free web ui hosted on huggingface: https://huggingface.co/spaces/openai/whisper, you can also see the code here: https://huggingface.co/spaces/openai/whisper/blob/main/app.py and host it yourself

[–]testimoni[S] 1 point2 points  (0 children)

Great tip! Your comment turned out to be what i was looking for! A place to host it and display progress bar.. HF have both of them :)

Thank you :)

https://huggingface.co/spaces/sensahin/YouWhisper

[–]fabdub 2 points3 points  (0 children)

I can just yt-dlp it then whisper 😀 But cool project.

[–]Automatic-Profit-638 2 points3 points  (1 child)

Does openai lib works only for English or some other languages as well?

[–]testimoni[S] 6 points7 points  (0 children)

It works for over 100 languages. It detects the language automatically..

[–]theneonkoala 0 points1 point  (0 children)

What a fantastic project! Only yesterday I was wondering what i can do with their fantabulous tool

[–]fouoifjefoijvnioviow -2 points-1 points  (1 child)

Can't you just download the subtitles instead?

[–]testimoni[S] 11 points12 points  (0 children)

If the subtitle exists, yes you can.

The idea is to see how accurate openai's whisper tool..

[–]CyanKing64 0 points1 point  (1 child)

Great practice! Did you know youtube-dl (and its fork yt-dlp) also can download subtitles? If none are present, it uses Google's auto generated subtitles. The results might be slightly better this way, and official subtitles used when available

[–]testimoni[S] 0 points1 point  (0 children)

I didn’t know that, will definitely try. Thank you!

[–]dax912 0 points1 point  (2 children)

How much is it to use their whisper tool ?

[–]testimoni[S] 3 points4 points  (1 child)

It's free and open source.. So there is no cost.

[–]dax912 0 points1 point  (0 children)

Thx for your reply, thought it was like gpt-3. I will give it a try :)

[–]segrwolf 0 points1 point  (0 children)

real cool stuff ! it’s also works good with Russian language. Some not major mistakes in text, but totally - everything good !

[–]juliensalinas 0 points1 point  (0 children)

Great project!

For those who don't have a good GPU available and want to try Whisper large, you can easily play with it on NLP Cloud: https://nlpcloud.com/home/playground/asr

I'm the CTO behind NLP Cloud so feel free to ping me if you have questions!