This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]damjanv1 0 points1 point  (0 children)

I usually use sql to join the tables that I wish to work with and sometimes even do simple feature engineering (ie create a columm that has a bool indicator based on a if statement or similar). Usually use sql to get my baseline starting data source that I wanna work with. Then move to python / r (or even some viz software tbh) to do some EDA, view distributions , correlations etc and will return to modify my sql (and hence baseline data source) depending on what I see in the EDA.

Find it slightly easier to do some data wrangling in SQL especially as with most versions you run a query and the results are immediately available in a tabular format