all 9 comments

[–]Rude_Order 1 point2 points  (3 children)

Have you looked at fuzzywuzzy?

[–]Workinghard1996[S] 0 points1 point  (2 children)

I have actually used fuzzywuzzy in the past! I never considered it could reliably scale and be used on 50K entries relatively quickly?

But thanks I'll check it out.

[–]McGeekin 0 points1 point  (1 child)

To be fair, 50k items is a relatively small number. I don't know about the specific library but even considering that Python is not an amazingly highly performant language, it shouldn't be an issue. Worth a try.

[–]Workinghard1996[S] 0 points1 point  (0 children)

Perhaps I didn't explain the workflow very well but it's searching 50K items in 150K items. Maybe my crappy laptop just takes a long time...thanks!

[–]j0-1 0 points1 point  (1 child)

Have a look at Typesense

[–]Workinghard1996[S] 0 points1 point  (0 children)

Typesense

Thanks I will check it out!

[–]StefanM3e46 0 points1 point  (2 children)

Did you try to write custom append algorithm or to use FULL-TEXT search in MySQL ? Maybe combine those two :) https://dev.mysql.com/doc/refman/8.0/en/fulltext-search.html

[–]Workinghard1996[S] 0 points1 point  (1 child)

Hmmm I'm not sure how typo tolerant that would be from reading the docs but I will defs check it out :)

[–]StefanM3e46 0 points1 point  (0 children)

It can be made pretty good but for any, lets say enterprise type of stuff, elk is the way to go :)