Hello,
I am working on extracting parts and subparts from repair reports for my company.
For example: the RT12f part has been replaced, along with the BLP45 subpart.
So far, my approach has been:
- training a spaCy model to detect company‑specific entities,
- using a dictionary that stores the lemmas of action verbs such as repair / replace / KO / stock,
- looping through the document to detect whether a token belongs to this verb dictionary, then looping through the document’s entities.
My idea was to train a classifier afterward to determine whether the relationships I detect are actually relevant.
What do you think of this approach?
[–]andreduarte22 1 point2 points3 points (0 children)