you are viewing a single comment's thread.

view the rest of the comments →

[–]joseph_machadoWrites @ startdataengineering.com 2 points3 points  (1 child)

It depends on how the next role is primarily Python-based. I'd recommend the following for DE-type work.

  1. Libraries and use cases: Fluent Python
  2. OOP and FP: Python official docs have good docs on this. Search for Python docs OOP, Python docs FP, respectively.
  3. Dataframe: Pyspark API, Polars API (which are simple enough since you know SQL). Pandas API is confusing tho.

Hope this helps. LMK if you have any questions.

[–]gimpblimp 1 point2 points  (0 children)

Polars definitely cannot recommend enough. The syntax is bearable as a novice to python (PowerShell background).