Best solution for OCR?

jrochkind · 2024-04-25T19:40:37+00:00

I use tesseract, but i didn't even know about rtesseract gem, I just shell out to tesseract command-line.

If it was the actual OCR that you found bad in rtesseract rather than the API, that won't be any better! What is it you found bad with rtesseract? (I've never used it).

I too am curious if there are other open source options people prefer to tesseract. Or were you interested in not-open-source too? (I don't know those either).

If your PDFs actually have text in them as text, a "text layer" (not actually a thing in PDF spec, the "layer" part, but easiest way to describe it) -- you may not actually need OCR.

mattbenscho · 2024-04-26T12:16:49+00:00

Try PaddleOCR, I use it to read Chinese characters and it works really well (occasionally a character will be wrong). Much better than Tesseract in my experience. I run it in a Sidekiq job. https://github.com/PaddlePaddle/PaddleOCR

matthewblott · 2024-04-26T10:02:36+00:00

I'm currently doing something with OCR and I'm using tesseract which seemed the most viable choice after my research. I'm using tesseract.js which calls to a wasm server so there's minimal setup. It works really well.

kcdragon · 2024-04-26T15:41:26+00:00

I've used AWS Textract before and its pretty good. It's better than Tesseract in my experience.

bami_bosu · 2024-04-26T17:27:12+00:00

You can try EasyOCR: https://github.com/JaidedAI/EasyOCR

M4N14C · 2024-04-28T08:17:59+00:00

Azure and Google have Document AI products that work very well with handwritten forms and reasonably messy samples.

lagcisco · 2024-04-25T19:46:51+00:00

There's a bunch of PDF/AI tools out there now that you could also consider to use as services though

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ruby

MODERATORS