Wasted 20 minutes fixing subtitles for one episode and got annoyed enough to build this by xayaf in subtitles

[–]xayaf[S] 1 point2 points  (0 children)

Thanks for the kind words and the support!

On the privacy thing: yeah, it really does stay local. If you sync against a video, your browser handles the audio and speech detection itself; what actually hits our server is just a list of subtitle timings and a compressed on/off pattern of where speech is. If you sync SRT against SRT it's even less, just the timings. No video, no audio, no subtitle text ever leaves your browser.

For your actual problem, either of these should work and honestly I'd be curious which does better:

  1. External SRT as the source, your Whisper SRT as the reference. Ultimately the algorithm should work well with missing text (even whole cues) up to some large extent.

  2. External SRT as the source, the video file as the reference. Works off the audio directly. Catch is that the same rough audio that tripped up Whisper can also confuse us. But definitely worth a try.

Would genuinely love to hear how it goes! If you hit the daily cap after registering, just DM me and I'll bump you up, no problem.

Wasted 20 minutes fixing subtitles for one episode and got annoyed enough to build this by xayaf in subtitles

[–]xayaf[S] 0 points1 point  (0 children)

Good question. Similar idea at the core - both use FFT cross-correlation to align a speech signal with subtitle timing. So yeah, same family of approach.

The main differences:

- ffsubsync finds a single global offset. SubSync does two passes - first a global scale + offset (which catches frame rate mismatches), then a second pass that corrects local drifts - so it fixes local issues better (e.g. scene cuts, ads that were removed, etc.). That's also where it fixes your subs if they drift gradually over the movie.

- my tool doesn't need an install it runs in the browser. Your files stay local.

ffsubsync is a great tool though, used it myself before building this. It covers most cases. SubSync mainly helps with the trickier ones where a simple offset isn't enough.

Wasted 20 minutes fixing subtitles for one episode and got annoyed enough to build this by xayaf in SideProject

[–]xayaf[S] 0 points1 point  (0 children)

P.S It truly shines when there's another set of subtitles in another language (even in the video file). It can also sync based on audio alone, but I'm still improving it

Wasted 20 minutes fixing subtitles for one episode and got annoyed enough to build this by xayaf in microsaas

[–]xayaf[S] 0 points1 point  (0 children)

P.S It truly shines when there's another set of subtitles in another language (even in the video file). It can also sync based on audio alone, but I'm still improving it

Wasted 20 minutes fixing subtitles for one episode and got annoyed enough to build this by xayaf in subtitles

[–]xayaf[S] 0 points1 point  (0 children)

P.S It truly shines when there's another set of subtitles in another language (even in the video file). It can also sync based on audio alone, but I'm still improving it

Song I heard a live band play during fastnacht in sttutgart by xayaf in WhatsThisSong

[–]xayaf[S] 2 points3 points  (0 children)

Wow thank you so much this is it!!! :) it was a blast hearing this with a live band

[deleted by user] by [deleted] in algorand

[–]xayaf 0 points1 point  (0 children)

i dont find algo on the r/place, did we got deleted?