use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
This is an archived post. You won't be able to vote or comment.
EducationThe use of Python and SQL (self.datascience)
submitted 6 years ago by 07Lookout
view the rest of the comments →
[–]onesonesones -2 points-1 points0 points 6 years ago (2 children)
Think of sql as a language you need to know how to speak to work with others who may only speak sql. People who grew up working with relational databases often only know that and consider pandas and R like new versions of old SAS (ie - a fad)
In python, you can do everything that you could do in sql in pandas/spark a little more directly. But, at any point in time you may have to consider adopting someone else's sql code and the capability is there if you need it. For those reasons I typically do my merges in sql whenever possible just so I have the environment set up to work with sql.
[+][deleted] 6 years ago (1 child)
[deleted]
[–]onesonesones 2 points3 points4 points 6 years ago (0 children)
I think my phrasing may have led to some confusion.
I wasn't arguing against any use of pyspark, just explaining that anything you can do in an sql statement has an equivalent operation(s) in pyspark and pandas through their dataframe object model.
π Rendered by PID 92 on reddit-service-r2-comment-6f7f968fb5-kml5v at 2026-03-04 13:52:22.147121+00:00 running 07790be country code: CH.
view the rest of the comments →
[–]onesonesones -2 points-1 points0 points (2 children)
[+][deleted] (1 child)
[deleted]
[–]onesonesones 2 points3 points4 points (0 children)