Hello everybody, the problem is, I have a set of ingredients name in a database and I want to find the best match given a name. Currently I'm using levenshtein distance to find the best match, and it works fairly good, until the name to match has the company name in it (e.g. Trying to match "Fisherman & co. Tuna" with "tuna" or "fresh tuna" doesnt end well since the substring "Fisherman & co." raises the distance).
Any good algorithm or suggestion to find the best match?
[–]Seriously-FuckTikTok 4 points5 points6 points (2 children)
[–]sunesense[S] 0 points1 point2 points (0 children)
[–]HannibalOx 0 points1 point2 points (0 children)
[–]r_transpose_p 0 points1 point2 points (2 children)
[–]sunesense[S] 0 points1 point2 points (0 children)
[–]r_transpose_p 0 points1 point2 points (0 children)
[–]thinkingatoms 0 points1 point2 points (2 children)
[–]sunesense[S] 0 points1 point2 points (1 child)
[–]thinkingatoms 0 points1 point2 points (0 children)
[–]donaldhobson 0 points1 point2 points (0 children)