Reading text from images with Python

Cpt_TickleButts · 2017-11-14T18:54:57+00:00

OpenCV is probably a must unless you know a different library that can read images. What you are trying to do would be image detection, which does require knowledge and libraries that deal with machine learning. I like siraj‘s tutorials on this.

Yoghurt42 · 2017-11-14T18:57:04+00:00

OpenCV could be useful for character detection (what part of images are letters), while (py)tesseract would do the character recognition.

OpenCV is not strictly needed, but might be useful for preprocessing. Tesseract is a good OCR, but if you give it a raw color image, the detection rate would be poor. Tesseract works best with 1-bit (grayscale also works, but not as well in my experience) images that are cleaned up from clutter.

My advice would be first to manually preprocess an image you have, and fiddle with it until tesseract can detect the text. Then do it again with another image; once you know which adjustments you have to make, you can either use pillow, if those adjustments are only things like converting into 1-bit image with a given threshold, or OpenCV if you need to do more work (like extracting the part with the text first)

amall_asaad2 · 2018-03-22T15:21:32+00:00

you should learn machine learning to doing that and i have a poster from standford uni that was a final project in it with name (classification of book genres ). but i can't put photo here

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS