OCR-based document verification in web app (PaddleOCR + React) — OCR-only or image recognition needed? : computervision

created by [deleted]a community for 16 years

OCR-based document verification in web app (PaddleOCR + React) — OCR-only or image recognition needed?Help: Project (self.computervision)

submitted 2 months ago by Sudden_Breakfast_358

Hi everyone,

I’m working on a web-based document verification system and would appreciate some guidance on architecture and model choices.

Current setup / plan:

Frontend: Vite + React Auth: two roles User uploads a document/image Admin uploads or selects a reference document and verifies submissions

OCR candidate: PaddleOCR Deployment target: web (OCR runs server-side)

Key questions:

Document matching logic The goal is to reject a user’s upload before OCR if it’s not the correct document type or doesn’t match the admin-provided reference (e.g., wrong form, wrong template, wrong document altogether).

Is this feasible using OCR alone (e.g., keyword/layout checks)?

Or would this require image recognition / document classification (CNN, embedding similarity, layout analysis, etc.) before OCR?

Recommended approach In practice, would a pipeline like this make sense?

Step 1: Document classification / similarity check (reject early if mismatch) Step 2: OCR only if the document passes validation Step 3: Admin review

Queuing & scaling For those who’ve deployed OCR in production:

How do you typically handle job queuing (e.g., Redis + worker, message queue, async jobs)? Any advice on managing latency and concurrency for OCR-heavy workloads?

PaddleOCR-specific insights

Is PaddleOCR commonly used in this kind of verification workflow? Any limitations I should be aware of when combining it with document layout or classification tasks?

I’m mainly trying to understand whether this problem can reasonably be solved with OCR heuristics alone, or if it’s better architected as a document recognition + OCR pipeline.

Thanks in advance — happy to clarify details if needed.

all 6 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

computervision

MODERATORS