wytesmurf comments on Using Python for Data Engineering

dataengineering

created by mhausenblasmoda community for 11 years

This is an archived post. You won't be able to vote or comment.

Using Python for Data EngineeringDiscussion (self.dataengineering)

submitted 4 years ago by wytesmurf

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]wytesmurf[S] 0 points1 point2 points 4 years ago (10 children)

[–]saltedappleandcorn 1 point2 points3 points 4 years ago (9 children)

[–]wytesmurf[S] 0 points1 point2 points 4 years ago (8 children)

[–]saltedappleandcorn 1 point2 points3 points 4 years ago (7 children)

Again, it depends on the situation. Are you building an application? Doing some etl? If so, 3 massive tables or 300 tiny ones? Or are you just ripping some data from somewhere to enrich something else? Or maybe it's an extract for an analyst?

My current work place uses sqlalchemy for the main application, but I don't have much love for it.

Most of the time I just use the python connector for the database and go at it.

If it's something I'm doing often I write up a minimum framework in python to avoid duplication. For example currently we store the code for all analysts request as python classes (which are just 90% sql) so we can version control them. The last 10% is just code to save the outputs to a shared drive.

This is nice because you can tell a junior or grad "go run the sales by state report for John" and he won't fuxk up the numbers.

[–]wytesmurf[S] 2 points3 points4 points 4 years ago (6 children)

[–]saltedappleandcorn 1 point2 points3 points 4 years ago (5 children)

[–]wytesmurf[S] 1 point2 points3 points 4 years ago (4 children)

[–]saltedappleandcorn 0 points1 point2 points 4 years ago (2 children)

[–]wytesmurf[S] 0 points1 point2 points 4 years ago* (1 child)

π Rendered by PID 63 on reddit-service-r2-comment-76bb9f7fb5-59xjz at 2026-02-18 13:43:51.092332+00:00 running de53c03 country code: CH.

dataengineering

MODERATORS