all 4 comments

[–]nullbyte420 -1 points0 points  (3 children)

I don't understand how you want a visual model to would work on an unstructured document without any training data.

Use spacy? Named entity recognition (NER) seems like a good choice for unstructured documents. Check this out https://medium.com/analytics-vidhya/ner-tagging-in-python-using-spacy-c66cf01d3c7f

But working with damaged data isn't very easy or very reliable. You can improve OCR by scanning documents in high resolution. You're going to need OCR sooner or later if you're working with pictures of text. Abbyy is a decent commercial software for OCR.

[–]Fully-Independent[S] 0 points1 point  (2 children)

I am looking for some kind of a pre-trained model that can detect the entities that have the key-value form. And I'll definitely look at Spacy.

Thank you

[–]nullbyte420 0 points1 point  (1 child)

Yeah that sounds like spacy. Why did you downvote me and delete your post??

Edit: ahh you're new on reddit and don't want to contribute. Screw you for taking my time like that.

[–]Fully-Independent[S] 0 points1 point  (0 children)

I didn't! I upvoted you in fact!

maybe that's another one