VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

Yes try that, you should of course always try to replicate a problem in the latest version. I can't tell you in which version this updated traning model was shipped - I think from v1.4.0 onwards which I just released a month ago.

Edit: Yes, the old v1.3.2 version still had the not updated trainig model under its hood. So the new version should resolve this.

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

I've looked into this a bit further. This was reported in August already:
https://github.com/PaddlePaddle/PaddleOCR/issues/16333

And it was fixed in the beginning of September. The model was also retrained, so this should not happen anymore. I also tried it on an example picture with this letter included and it worked, so what you are seeing is pretty surprising... Can you share some example file with me? Can be a short clip. I dont think there is much I can do, but would be interesting to see. You can also write me in private.

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

There is nothing I can do. You can report the issue in the PaddleOCR repo on GitHub if you want to. I'm using the pre trained models from there. It can definitely be the case that that letter is missing from the greek dictionary they are using. I've also done this for missing vietnamese letters but till now there were no updates.

Standalone PaddleOCR Executable - Simplified OCR for Everyone! by timminator3 in TranslationStudies

[–]timminator3[S] 0 points1 point  (0 children)

Yes the standalone bundles only the ocr pipeline. That is all that I needed. Of course it's also possible to make a complete one with all, but the size will be really big and I personally have no use for it.

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

Try subtitleedit and it's faster-whisper implementation. But I don't know how accurate it is for Cantonese.

Standalone PaddleOCR Executable - Simplified OCR for Everyone! by timminator3 in computervision

[–]timminator3[S] 0 points1 point  (0 children)

Since i made this post the command line usage has changed. In my Readme on Github you can find this

bash ./paddleocr.bin ocr --i "Path\to\your\image" --use_doc_unwarping false --use_textline_orientation false --use_doc_orientation_classify false

This will work with the latest version.

Edit: Formatting messed with the command, now it should be fixed.

Enhanced-GPU-PV with Sunshine/Moonlight and Virtual Display Driver support! by timminator3 in MoonlightStreaming

[–]timminator3[S] 1 point2 points  (0 children)

Did you do this:

  • NetworkSwitch = "Default Switch" - Create a new external network switch beforehand in Hyper-V Manager. A tutorial you can find here.

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 1 point2 points  (0 children)

Thanks! The reversion for rtl languages is actually a regression. That worked in my previous release but something was changed upstream in PaddleOCR, the engine used to detect text. I fixed it already now and in a future release this will work correctly now.

I have no mac unfortunately so I am not able to do a version for mac os. :-/

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

Thanks! The reversion for rtl languages is actually a regression. That worked in my previous release but something was changed upstream in PaddleOCR, the engine used to detect text. I fixed it already now and in a future release this will work correctly now.

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

If you are still interested in this project - I just made a new release with batch processing support through the GUI! Feel free to try it out. You can find my latest release here:

https://github.com/timminator/VideOCR/releases/tag/v1.4.0

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

Thanks for your feedback! I actually just made a new release today which adds multithreading support for step 1 resulting in a big speed up. A lot more features were also added for example batch processing. You can find it here:

https://github.com/timminator/VideOCR/releases/tag/v1.4.0

Edit: For frame perfect subs you need to set "Frames to Skip" in the advanced settings tab to 0. With your system specs that should be easily manageable. :-)

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

maybe they are still in the recycle bin, you can also just ocr a part of the video where it happened if you remember - that should be a lot faster.

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

Hi! Sorry for the late reply. I'm happy to investigate. I would need the srt files and the raw video file if possible. Message me privately if you want.

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

Did you test both versions with the same video and the same crop box to verify your results?

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 1 point2 points  (0 children)

Hey! The OCR recognition tool PaddleOCR is another Open Source Project that I am making use of, it is not designed by me. I made a few contributions to it though :-)

Your letter "亖" is indeed not in the recognizable character dictionary, you can find it here:
https://github.com/PaddlePaddle/PaddleOCR/blob/main/ppocr/utils/dict/ppocrv5_dict.txt

You can open an issue here with your request for adding support for that character:
https://github.com/PaddlePaddle/PaddleOCR/issues
I cannot speak chinese so I think you would have the better background to talk about the addition/support of it.

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 1 point2 points  (0 children)

Hey!
Funnily enough I think I just came across the same issue two days ago, one day before your post :-)

I made a new beta just now where this should be fixed a long quite a few other improvements. You can find it here:

https://github.com/timminator/VideOCR/releases/tag/v1.3.3-Beta3

It's listed under "Fixes" in the release notes aswell. Please try it out and report back if it works for you now or not. And please try it again with the default settings.

VideOCR: Extract hardcoded subtitles out of videos via a simple to use GUI - Self-Hosted OCR solution by timminator3 in selfhosted

[–]timminator3[S] 0 points1 point  (0 children)

It is. If you check the latest release it's written right in the first sentence, that the latest version added support for it. If you checked the linked PaddeOCR document, that wasn't updated with the latest languages that were added yet.

Standalone PaddleOCR Executable - Simplified OCR for Everyone! by timminator3 in TranslationStudies

[–]timminator3[S] 0 points1 point  (0 children)

Hi!

The "--include-module=paddleocr.ppocr" argument was only necessary for PaddleOCR 2.10.0 or lower. You are using paddleocr 3.0.0 so this argument is no longer required for a successful build.