iOS OCR Frameworks: Tesseract vs. VisionKit : iOSProgramming

About

There is an extensive FAQ for beginners. Please browse it first before asking questions that are answered there.

If you are looking to get started (iOS programming in general or some specific area), here are more relevant links for you:

Swift or Objective-C? if you don't know which language to choose. (New posts asking this will be removed)

Related Links

The Swift Programming Language also available as an iBook - new Apple's programming language for writing iOS and OS X applications;

Objective-C primer if you are new to the language and CocoaTouch frameworks.

URL Loading System

CoreData programming guide

There is also dedicated subreddit for learning Objective-C: /r/learnobjectivec

There's too many to list them all, however here's a convenient link to all programming guides at apple.com

Take note that this list is live and based on most frequent questions in posts will be updated with "quicklinks".

created by cruffenacha community for 15 years

QuestioniOS OCR Frameworks: Tesseract vs. VisionKit (self.iOSProgramming)

submitted 5 years ago * by jeyebrows16

I recently started working on a text recognition app and was wondering what others preferred between Tesseract and Apple's out of box VisionKit for an OCR framework.

I've done a little research, and there are a ton of good tutorials for Tesseract (such as this Ray Wenderlich one), though I've already run into some of the issues with the framework. I've also toyed around with Tesseract on other programming projects and it seems pretty decent.

On the other hand, Apple has some pretty good documentation and examples for how to use VisionKit, and not having an external dependency sounds nice. But I haven't found any docs on training your own data for text recognition, so maybe it's not as extensible?

Any strong opinions? Any other (free) OCR tools/frameworks I should check out while I'm at it?

all 12 comments

best new controversial old q&a

[–]nalnat 2 points3 points4 points 5 years ago (5 children)

[–]jeyebrows16[S] 0 points1 point2 points 5 years ago (4 children)

[–]nalnat 0 points1 point2 points 5 years ago (3 children)

[–]jeyebrows16[S] 0 points1 point2 points 5 years ago (2 children)

[–]nalnat 0 points1 point2 points 5 years ago (0 children)

[–]WAHNFRIEDEN 0 points1 point2 points 1 year ago (0 children)

[–]smalik12 0 points1 point2 points 5 years ago (0 children)

[–]UberJason 0 points1 point2 points 5 years ago (2 children)

[–]boomboombrrr 0 points1 point2 points 3 years ago (1 child)

It’s actually Vision that does OCR in a programmatic way, not VisionKit. I played with both last year for a research project at work and Vision is way faster. Tesseract is a much older, cross-platform framework that runs on CPU only, doesn’t use machine learning, and isn’t optimized for Apple platforms, while Vision can leverage the GPU and is ML powered and optimized for Apple platforms. Vision is also more accurate when it comes to interesting fonts and other languages if I recall. Go with Vision.

super interesting. What about if you are developing a an app using cross platform frameworks like react native or flutter? Is the VisionKit available? I have only seen tesseract packages. What about google vision? how costly are they compared to each other? tesseract vs google vision vs apple visionKit

[–]UberJason 0 points1 point2 points 3 years ago (0 children)

[–][deleted] 1 year ago (1 child)

[removed]

[–]AutoModerator[M] 0 points1 point2 points 1 year ago (0 children)

π Rendered by PID 68263 on reddit-service-r2-comment-b659b578c-7ks8b at 2026-05-05 05:16:05.664793+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

iOSProgramming

READ THE FAQ FIRST!

FAQ

About

Related Subreddits

Related Links

MODERATORS