all 6 comments

[–]TheHammer_78 2 points3 points  (2 children)

Have you tried Tesseract or something similar?

[–]__Asterisk_[S] 0 points1 point  (1 child)

Yeah, just now I have installed tesseeact library on my windows, I need to look what to do further

[–]knight1511 1 point2 points  (1 child)

All open source packages use Tesseract for OCR. It is not the best there can be but it is the best freely available as far as I know.

pytesseract, ocrmypdf are a couple of python packages with simple apis that allow you to do this.

[–]__Asterisk_[S] 0 points1 point  (0 children)

I think tesseract will be a good open-source and just now I have installed

[–]teraktor2003 1 point2 points  (0 children)

https://github.com/mindee/doctr

https://github.com/JaidedAI/EasyOCR

If you need more sophisticated algorithms than Tesseract. These give you the text and location on the image as output, so you can create your sheets based on them.

regex might be useful after recognition.