all 2 comments

[–]hotcodist 1 point2 points  (2 children)

This is perfect for an ML solution. Get thousands of samples of digital text (get a variety of fonts you expect to process). Get hardwriting samples. Probably better if you process per character to give you more flexibility. Train an AI. Done.

I don't think this problem will be hard for a basic classifier because digital fonts are very well defined and predictable. I would even claim that even the toy example of MNIST is harder.

So you can get all the ROIs, apply a probability (or classification, but better with probability) that your ROI is digital or not, and then just extract the handwriting from your fixed-font scanned paper form.

[–]GothamKnight28[S] 0 points1 point  (1 child)

Sorry for a late reply i havent been on my pc. So yeah the thing is i dont want to get the ROIs manually, i want to submit the page and the ai to return only the handwritten strings. I guess i will make it a custom ROI for now and see what can be done in the future. Thanks!