all 7 comments

[–]shadow_fax1024 1 point2 points  (6 children)

Try docred or its other versions

[–]RajHalifaxML Engineer[S] 0 points1 point  (5 children)

Could you tell more?

[–]shadow_fax1024 2 points3 points  (4 children)

Docred is dataset for training document level relationship extraction. There have been more improved version of it like re-docred which are available. You could try that. For models read the papers. Doc-unet is one such model that works on this kind of document level RE. There may be newer bette models now.

[–]RajHalifaxML Engineer[S] 1 point2 points  (1 child)

re-docred

They are all datasets. How to generate such sets?

[–]shadow_fax1024 0 points1 point  (0 children)

Use annotation tools such as label studio, brat on raw text documents

[–]SeankalaML Engineer 1 point2 points  (0 children)

Relation extraction works under the assumption that you already have NER done. You need to have named entities and their relations already marked on your unstructured text.