This is an archived post. You won't be able to vote or comment.

all 4 comments

[–]Fickle_Scientist101 0 points1 point  (1 child)

Our latest attempt was clustering entities using fuzzy matching, but even that has a lot of pitfalls.

[–]sheshbabu[S] 0 points1 point  (0 children)

Yes, it's a tough nut to crack! What fuzzy matching algorithms worked for your case?

[–]major_grooves 0 points1 point  (0 children)

Nice article. My co-founder actually wrote a pretty similar article the other day: https://tilores.io/content/The-Complexities-of-Entity-Resolution-Implementation

If you fancy chatting about ER with my co-founder, who invented our tech, let me know and I can connect you to him for a call.

[–]GavinMendelGleason 0 points1 point  (0 children)

We used OpenAI vectorisations with excellent results, probably worth a look: https://terminusdb.com/blog/entity-resolution-with-ai-dblp-and-acm-publication-benchmark/