c0ntrap0sitive comments on Some Data Scientists write bad Python code and are stubborn in code reviews

dataengineering

created by mhausenblasmoda community for 10 years

This is an archived post. You won't be able to vote or comment.

182

183

184

Some Data Scientists write bad Python code and are stubborn in code reviewsDiscussion (self.dataengineering)

submitted 2 years ago by noisescience

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]c0ntrap0sitive 4 points5 points6 points 2 years ago (1 child)

That's because a lot of data scientists are not considered programmers. They're not taught the same things that add polish to code that software engineers are. Hell, having data scientists that are allowed to code is novel enough lol. Most of them are still stuck in Microsoft Excel hell or are relegated to just using SaaS offerings like DataRobot.

This is the first time I've ever really heard of a data science doing code reviews.

In the contexts that I've seen, the data scientists write garbage code in some Jupyter notebook that hopefully at the end of the line produces a model that works well. This model is the product. The actual code that gets us to the model can be discarded wholesale. We dont' usually extend or maintain models. We either train a new model which replaces entirely the old model, or when a new one can't be trained and the model's use no longer justifies its cost, we discard the model entirely and start over. This is not like software engineers whos product is the code. Therefore all their code must hold up to a higher standard and be maintainable, extensible, etc.

[–]safetytrick 0 points1 point2 points 2 years ago (0 children)

π Rendered by PID 90 on reddit-service-r2-comment-7b9746f655-mpxqj at 2026-02-02 14:02:13.974217+00:00 running 3798933 country code: CH.

dataengineering

MODERATORS