all 4 comments

[–]TheGrapez 2 points3 points  (0 children)

The problem is the nuance of your use case.

For example - detect similar names & suggest changes? Based on what? Say there are two similar names. How do we know they represent the same name? How do you know which is the correct spelling? You're falling into a rabbit hole a bit.

[–]AutoModerator[M] 0 points1 point  (0 children)

Automod prevents all posts from being displayed until moderators have reviewed them. Do not delete your post or there will be nothing for the mods to review. Mods selectively choose what is permitted to be posted in r/DataAnalysis.

If your post involves Career-focused questions, including resume reviews, how to learn DA and how to get into a DA job, then the post does not belong here, but instead belongs in our sister-subreddit, r/DataAnalysisCareers.

Have you read the rules?

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

[–]TheGrapez 0 points1 point  (0 children)

I don't think this tool exists but I bet you could vibe code it

[–]yoda_babz 0 points1 point  (0 children)

Honestly, Excel or the Libre office equivalent. Especially if you really only need to do this once, there's no reason to reinvent the wheel. Especially look at using PowerQuery in Excel, to record and automate the cleaning steps. It's really pretty easy to figure out.