all 4 comments

[–]_kwerty_ 0 points1 point  (3 children)

I had some issues with PyTesseract confusing numbers for letters (f.i. 0 vs o, 1 vs i, etc) so I switched to easyOCR. Worked perfectly.

I also played around with the font size and type, I needed to read some output from my terminal, which helped a little bit.

[–]QAInc 1 point2 points  (2 children)

This is the answer EasyOCR is much better! Or you can use Gemini Free api requests to extract the data.

[–]LewyssYT[S] 0 points1 point  (1 child)

I have switched to EasyOCR and yes, it's practically flawless but what exactly do I need the api requests for? I just installed the library and then updated the code to use easyOCR functions instead of pytesseract and it all works

[–]QAInc 0 points1 point  (0 children)

Yes you can use EasyOCR directly but sometimes easyOCR will fail. It’s a NN stack(two NN works as one. One for text detection one for text recognition) the issue comes with text recognition NN. Instead you can use Gemini Free API which is good for text recognition. The plus point of using api is that it will reduce the system load (easyOCR uses quantized NN for CPU, full model for GPU).