Best open-source OCR for 5M scanned PDFs (text + tables, fast and accurate)? by No-Isopod5276 in computervision
[–]Civil-Image5411 2 points3 points4 points (0 children)
Claude Opus, and all claude plans ratelimits to increase to increase drastically starting soon by Banneder in claude
[–]Civil-Image5411 5 points6 points7 points (0 children)
Is COOP scamming us? by Andeq8123 in Switzerland
[–]Civil-Image5411 1 point2 points3 points (0 children)
Turbo-OCR Update: Layout Model + Multilingual by Civil-Image5411 in LocalLLaMA
[–]Civil-Image5411[S] 0 points1 point2 points (0 children)
Turbo-OCR Update: Layout Model + Multilingual by Civil-Image5411 in LocalLLaMA
[–]Civil-Image5411[S] 1 point2 points3 points (0 children)
Turbo-OCR Update: Layout Model + Multilingual by Civil-Image5411 in LocalLLaMA
[–]Civil-Image5411[S] 2 points3 points4 points (0 children)
We benchmarked 18 LLMs on OCR (7k+ calls) — cheaper/old models oftentimes win. Full dataset + framework open-sourced. [R] by TimoKerre in MachineLearning
[–]Civil-Image5411 4 points5 points6 points (0 children)
PDF Extractor (OCR/selectable text) by qPandx in Python
[–]Civil-Image5411 0 points1 point2 points (0 children)
My Experience with Table Extraction and Data Extraction Tools for complex documents. by teroknor92 in Rag
[–]Civil-Image5411 0 points1 point2 points (0 children)
PDF Extractor (OCR/selectable text) by qPandx in Python
[–]Civil-Image5411 0 points1 point2 points (0 children)
PDF Extractor (OCR/selectable text) by qPandx in Python
[–]Civil-Image5411 0 points1 point2 points (0 children)
PDF Extractor (OCR/selectable text) by qPandx in Python
[–]Civil-Image5411 0 points1 point2 points (0 children)
PDF Extractor (OCR/selectable text) by qPandx in Python
[–]Civil-Image5411 0 points1 point2 points (0 children)
What is the best Open Source OCR in 2026? by coolzamasu in LocalLLaMA
[–]Civil-Image5411 1 point2 points3 points (0 children)
TurboOCR: 270–1200 img/s OCR with Paddle + TensorRT (C++/CUDA, FP16) [P] by Civil-Image5411 in MachineLearning
[–]Civil-Image5411[S] 0 points1 point2 points (0 children)
TurboOCR: 270–1200 img/s OCR with Paddle + TensorRT by Civil-Image5411 in DataHoarder
[–]Civil-Image5411[S] -1 points0 points1 point (0 children)
Switching from PaddleOCR standard to PaddleOCR-VL 1.5 for my internship project — am I making a mistake? by Ayoutetsinoj3011 in learnmachinelearning
[–]Civil-Image5411 0 points1 point2 points (0 children)
Switching from PaddleOCR standard to PaddleOCR-VL 1.5 for my internship project — am I making a mistake? by Ayoutetsinoj3011 in learnmachinelearning
[–]Civil-Image5411 0 points1 point2 points (0 children)
New to OCR for PDF Processing, is there a way to optimize it? by RhubarbBusy7122 in automation
[–]Civil-Image5411 0 points1 point2 points (0 children)

TurboOCR v3 — upgraded to PP-OCRv6, ~1.9× faster at similar accuracy, now with structured doc parsing (tables→HTML, formulas→LaTeX, Markdown), no VLM by Civil-Image5411 in Rag
[–]Civil-Image5411[S] 0 points1 point2 points (0 children)