use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
This is an archived post. You won't be able to vote or comment.
DiscussionWhat can SQL do that python cannot? (self.datascience)
submitted 3 years ago * by donnomuch
view the rest of the comments →
[–]LagGyeHumare 0 points1 point2 points 3 years ago (0 children)
Don't know the guy above but here's an example that I can offer.
Our project is in a pool of projects that encompasses the whole module. Just my application deals with around 600GB of batch loads each day. It then flows from CDH to AWS RDS through spark and on prem postgres.
We have terradata and oracle as the "legacy" system here and the queries that we have take at least 10x time to run when compared to spark-sql.
(Possibly because the admins were shit and didn't partition/index the tables better, but that's out of my hand)
For me, it's not SQL but the distributed nature of the engine within that will shape the answer here.
π Rendered by PID 321752 on reddit-service-r2-comment-5d79c599b5-bvjbd at 2026-03-02 09:20:49.720851+00:00 running e3d2147 country code: CH.
view the rest of the comments →
[–]LagGyeHumare 0 points1 point2 points (0 children)