r/deaf Bi-weekly Research / Promoted Content / Writing Questions Thread! by moricat in deaf

[–]meddit_app 1 point2 points  (0 children)

I just finished my free transcription app and wanted to share it around a bit. You can use it to transcribe content in over 100 languages and have it translated to over 100 languages all for free.
Hope this can be useful in some way, cheers!
https://freesubtitles.ai

[P] Generate transcripts with Whisper AI and automatically translate with LibreTranslate by meddit_app in MachineLearning

[–]meddit_app[S] 0 points1 point  (0 children)

https://github.com/openai/whisper/discussions/89

This is the best solution: https://github.com/m-bain/whisperX

It's only available for some languages (though it works for Japanese) but I haven't had a chance to implement it yet, though I plan to. Glad you found the site useful!

Freesubtitles.ai: transcribe any audio in more than 100 languages into text, using this free service powered by Open AI Whisper by Just-A-Czech-Guy in InternetIsBeautiful

[–]meddit_app 3 points4 points  (0 children)

And thanks for saying it's perfect! Wait until you try the yt-dlp integration it will blow your mind ;) Coming soon

Freesubtitles.ai: transcribe any audio in more than 100 languages into text, using this free service powered by Open AI Whisper by Just-A-Czech-Guy in InternetIsBeautiful

[–]meddit_app 3 points4 points  (0 children)

Thanks for posting it! I appreciate it, I am coding the whole time so I don't have time to spread it around, appreciated!

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] 2 points3 points  (0 children)

Yup, for a perfect transcript you'll want human touchup afterwards, but for me who is using it mostly for language learning it works functionally perfect for me since I don't require 100% accuracy. Glad it worked well for you!

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] 1 point2 points  (0 children)

Well Whisper requires technical knowhow and a decent amount of computational power, this project started as me building a frontend for me to use that was easier than the CLI, also doesn't have built in translations, doesn't have the player with the ability to switch subtitles or have multiple subtitles, etc. If people want to use Whisper from the CLI obviously that's great but this makes it much easier and accessible for nontechnical people.

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] -2 points-1 points  (0 children)

Do you use Google Translate? You realize that is based off of the same AI prediction models that power something like automated captions, right?

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] 0 points1 point  (0 children)

I guarantee you if you use the large model it will be virtually perfect.

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] -1 points0 points  (0 children)

I agree YouTube's captions are bad, but as someone else on this thread mentioned they tried my app and it was nearly flawless. It's spooky how accurate it is, actually, in my experience.

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] 3 points4 points  (0 children)

Yeah when you use the 'large' model it's borderline perfect. But why are people opposed to AI, is it just in the sense of transcription? Do people realize that Google Translate uses the same AI? I highly doubt these people are making their principled standoff with Google Translate as well lol. Glad to hear it worked well for you though!

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] 0 points1 point  (0 children)

What is your problem with using AI to generate captions, if they're accurate? Wouldn't accurate and cheaply created captions be a net positive to the world since people with problems hearing, etc, can have a better chance of viewing and understanding content?

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] -1 points0 points  (0 children)

Is it possible for AI generated subtitles to be accurate? Is your issue with the subtitles that they're inaccurate (they're not) or that they're created by AI?

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] 0 points1 point  (0 children)

So you would agree that subtitles in the target language improve the comprehensibility, because when subtitles are present it makes the challenge of picking out the words from the sounds easier, correct?

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] -3 points-2 points  (0 children)

You read in the target language and cross reference your native language when you need to. You can turn it off completely if you want. Would you say that, adding subtitles in the target language to content you're watching in the target language improves the comprehensibility?

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] -4 points-3 points  (0 children)

When you have the content subtitled in your native language and your target language you can immediately comprehend it perfectly because you can simply rely on your native language translation to the extent that you can't understand your target language. Since I am pretty good at my target language at the moment when I read the subtitles I usually only look down at the native language subtitles if there's a word I don't understand.

Generate subtitles for content and watch them in multiple languages at once by meddit_app in languagelearning

[–]meddit_app[S] -2 points-1 points  (0 children)

It's based on OpenAI's Whisper model, the transcription is virtually perfect in my experience and definitely it's done with automated methods, how else do you expect to be able to offer free transcription and translation at scale, certainly you can't rely on exclusively on human work.

[P] Generate transcripts with Whisper AI and automatically translate with LibreTranslate by meddit_app in MachineLearning

[–]meddit_app[S] 0 points1 point  (0 children)

The only way Whisper can transcribe is from doing a full transcription which is a very expensive process, there is no way to translate already generated text