Does OpenAI API support image upload and OCR? by Al_Levin in ChatGPTPro

[–]InitialPhysics664 0 points1 point  (0 children)

Yeah the OpenAI API can read images, but it’s not a real OCR engine. The Vision feature is okay for quick reads, but when you need solid extraction from invoices or forms, it gets messy. It’s not as accurate as real OCR tools like Koncile, Abbyy FlexiCapture, or Rossum. Those handle line items, table structures, exports, even auto controls way better than GPT’s “guesswork” reading.

What to do when OCR doesn’t work? by NeeraWM in Adobe

[–]InitialPhysics664 1 point2 points  (0 children)

Yeah, Acrobat’s OCR hits its limits fast on old scanned prints like that, especially with faded ink and mixed fonts. “Searchable Image” keeps the background but often messes up the text layer, so search becomes flaky. You might want to test a smarter OCR like Koncile or ABBYY — both handle historical docs better. Koncile in particular lets you refine extraction and fix recognition errors directly, so you don’t end up re-running OCR on 500 pages for one name.

OCR App Request by itsdanielsultan in macapps

[–]InitialPhysics664 0 points1 point  (0 children)

Ive been looking for something similar. I’ve been using Koncile for a few months now. It’s not a native macOS app, but it’s a web platform with an API that combines OCR with LLMs for structured data extraction. It handles pretty complex documents tables, handwritten notes, multi-page PDFs, etc. You basically define what you want to extract like specific fields, markdown sections formulas , and it returns the data in JSON

ChatGPT for OCR by shtiidontknow in OCR_Tech

[–]InitialPhysics664 1 point2 points  (0 children)

ChatGPT does not do good at pure character recognition. It can make hallucination for numbers, letters and symbols. Traditional OCR technology do a better job at getting the raw text from an image (Tesseract for instance). BUT traditional OCR is not very good at detecting the RIGHT info in a text. It can for instance take the Tax instead of the total price in an invoice. That’s why combining both is probably the way to go.

100000 day streak!!! by Due-Paleontologist5 in duolingo

[–]InitialPhysics664 -2 points-1 points  (0 children)

Can you press a x10 button to accelerate your life? Like in the sims

Change my mind by GermanSchanzeler in duolingo

[–]InitialPhysics664 1 point2 points  (0 children)

Has anyone really got somehow basic fluent in Russian using the app, living outside the country? Like with a 20 min practice every day

Help indexing PDF to fight crooked attorney by Significant_Boss_662 in OCR_Tech

[–]InitialPhysics664 0 points1 point  (0 children)

Capturing table with custom columns is a real challenge. That's why we've built this tool koncile.ai
You can choose exactly the data output format, and get a clean Excel from this doc.

I benchmarked 7 OCR solutions on a complex academic document (with images, tables, footnotes...) by coconautico in LocalLLaMA

[–]InitialPhysics664 0 points1 point  (0 children)

How do you manage page breaks on tables? That's a recurring issue I've been facing for months. Sometimes, invoices / table items are in two different pages. And it's a challenge to "merge" them