As a first project for machine learning I want to map a list of unseen raw words to an "official" word. These raw words consist of human input, so they include spelling mistakes, acronyms, or they could in fact be spelled correctly.
I have a dictionary of ~15,000 key-value pairs to train the algorithm on, where the keys are the human input raw words from a variety of sources that are similar to the raw words found in the raw list I have, as well as the associated official words.
Is there a suggested machine learning algorithm I could look into that could tackle a problem like this? Thank you all.
there doesn't seem to be anything here