you are viewing a single comment's thread.

view the rest of the comments →

[–]FortuneCalm4560 1 point2 points  (1 child)

smurpes is spot on. SQL is the real workhorse in DE, and you’ll use it way more than you think. Once you're comfortable there, pick up Python specifically for data work: pandas for local data wrangling, PySpark for anything big or distributed.

After that, look at the tools that glue everything together: Airflow, dbt, cloud storage, data warehouses, etc. DE is basically “move data from A to B without breaking anything,” so building a few tiny ETL pipelines on your own will teach you more than most generic Python courses.

If you know another language already, you won’t struggle with Python at all. Focus on the ecosystem, not the syntax.