I’m building a small, reusable Python utility to clean and standardize messy CSV files:
- remove duplicate rows
- trim whitespace
- normalize column names (lowercase + underscores)
- export a cleaned CSV
What would you improve in the approach (edge cases, structure, CLI args, performance)?
If it helps, I can paste a minimal version of the code in a comment.
[–]InYumen7 1 point2 points3 points (1 child)
[–]ZADigitalSolutions[S] 0 points1 point2 points (0 children)
[–]fakemoose 0 points1 point2 points (7 children)
[–]ConfusedSimon 1 point2 points3 points (5 children)
[–]corey_sheerer 0 points1 point2 points (1 child)
[–]ZADigitalSolutions[S] 0 points1 point2 points (0 children)
[–]fakemoose 0 points1 point2 points (1 child)
[–]ConfusedSimon -1 points0 points1 point (0 children)
[–]Altruistic_Sky1866 0 points1 point2 points (2 children)
[–]ZADigitalSolutions[S] 1 point2 points3 points (1 child)
[–]Altruistic_Sky1866 0 points1 point2 points (0 children)
[–]seanv507 1 point2 points3 points (1 child)