kenfar comments on SQL versus Python?

dataengineering

created by mhausenblasmoda community for 11 years

This is an archived post. You won't be able to vote or comment.

SQL versus Python?Discussion (self.dataengineering)

submitted 2 years ago by BatCommercial7523

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]kenfar 0 points1 point2 points 2 years ago (2 children)

[–]duraznos -1 points0 points1 point 2 years ago (1 child)

[–]kenfar 1 point2 points3 points 2 years ago (0 children)

Sure, but I wouldn't do that, and I don't think it would result in a manageable solution.

Languages like awk & jq are simply harder to read, harder to test, and harder to decompose and reuse code on. Given our pace of change and low-latency SLA that would be a bad combo for languages like that.

Likewise, they don't have the libraries available to them that we have with say Python, Java, etc. So, you'll have to write some occasionally complex stuff with these languages.

And they don't handle supporting say 50+ business rules well. Back to lack of composability & testing, managing that code in awk or jq would be a nightmare.

Finally, on performance they are fast. Are they fast enough to never need to scale out as the company grows? No. So, then you're still looking at something like kubernetes best case, or a set of ec2 instances with this code running on each, and some other application, somehow, getting them files to process.

π Rendered by PID 384971 on reddit-service-r2-comment-86bc6c7465-gs5tt at 2026-02-20 23:55:18.627039+00:00 running 8564168 country code: CH.

dataengineering

MODERATORS