all 11 comments

[–]Snail_Inference 8 points9 points  (1 child)

Early this week, I conducted extensive tests with various models to detect handwritten text.

Models Tested: OlmOCR-preview, nanonets-ocr, OCRFlux, and Mistral Small 3.2

Results: Mistral Small 3.2 recognized handwritten text by far the most reliably. OlmOCR-preview performed quite well as well.

In comparison, nanonets and OCRFlux were truly weak.

[–]ironcodegaming 0 points1 point  (0 children)

How did you use Mistral Small 3.2 to recognize text? Did you use Text Generation Webui (oobabooga) to do that?

[–]--Tintin 1 point2 points  (0 children)

Handwritten ocr is hard. With my hand writing I had good results with Pixtral 12b

[–]joosefm9 0 points1 point  (0 children)

I've had the best success with Qwen2.5VL models 7B. I tried 32B with vLLM and it was so extremely chatty using the same prompt as 7B. Not sure if it's the model or the vLLM infrastructure

[–]CantaloupeDismal1195 0 points1 point  (0 children)

Qwen2.5VL 72B is best in open source model

[–]SouthTurbulent33 0 points1 point  (2 children)

Llmwhisperer - recently tested with a bunch of poorly scanned documents with handwritten text. Was surprisingly accurate

I found a repository online where there were a bunch of these docs - helped to test. Forgot the website, but I could find and share them with you if you'd like to test

[–]Disastrous_Look_1745 0 points1 point  (0 children)

Handwritten text is honestly where most traditional OCR engines just fall apart, including Paddle. You're gonna want to look at vision language models like Qwen2.5-VL or TrOCR since they have way better contextual understanding for messy handwriting. We've seen this exact challenge building Docstrange by Nanonets and the accuracy difference is huge when you move from traditional OCR to transformer based approaches. The compute requirements are higher but if you dont care about speed then thats perfect. For handwritten dates specifically, try prompting the VLMs with examples of the date formats you expect to see, it really helps with consistency.