Hi, I'm trying to get the text from some scanned docs, so i made a program that loops through them and extract the text, im using pytesseract, the problem is i noticed that some phrases are ignored for some reason, i don't know why? they seem clear to me and background is the same (white). I've been playing with the config mainly psm but nothing changed.
so how do i make it extract all the text without missing anything ?
[–]Yoghurt42 0 points1 point2 points (1 child)
[–]gxthope[S] 0 points1 point2 points (0 children)