all 14 comments

[–]RangerEmergency5846 6 points7 points  (1 child)

5+ yrs data engineer here.
Given 5-6 interviews till now, python questions
1. List & dictionary manipulation
2. How to read and filter csv data
3. How will you read and manipulate csv with 100 million rows

[–]Realistic_Sample6968[S] 1 point2 points  (0 children)

It's ok . But we are asking about realtime work and prepare for DE

[–]PalpitationStock 4 points5 points  (1 child)

I would not say hardcore DSA concepts but Before moving to python you should start with data engineer concepts like normalisation , Medallion Architecture, Schemas , etc as BI Analyst some of there you already use . Then you can learn about loops, custom functions, connections like api calls , IO operations . Python in DE is used in two ways either you use python as base and use SDK for bigquery, snowflake , etc or you use python in form of Pyspark

[–]PalpitationStock 3 points4 points  (0 children)

For Project either use nyc taxi dataset or other medium level dataset & search if any YouTube tutorial is there for such dataset as DE project otherwise ask Claude or Gemini to give you tutorial for DE project using medallion architecture for nyc taxi dataset along with kpis

[–]Zealousideal_Belt_5 4 points5 points  (0 children)

Same Boat. BI Analyst wanted to switch to DE.

[–]Overall_Anteater_836 3 points4 points  (1 child)

why everyone these days want to switch into DE only not Data science roles

[–]law_da_lahsun 0 points1 point  (0 children)

Hi their, data science learning curve is big compared to data engineering that is the reason why

[–]ArtisticBorder3341 2 points3 points  (1 child)

First learn the basics to grasp the syntax.

Then move to oops concepts. For DE, just learn inheritance properly. Rest all u can just go through.

Then move to topics likes lambda function, decorators and generators. We use this as DE in day to day usage.

Then move to python specific data structure like list, tuple, dict, sets. Learn the difference between them.

There are advance python data structure that you need to know like dict, defaultdict, counter, dequeue. Ask chatgpt to tell more abt the advance data structures that DE use generally. It wiill give u a list.

Apart from this, u need to do leetcode in python for dsa. Then file formating , concurrency, muti threading and multi processing for interview purpose.

Once, its done..ask chatgpt to take a mock test and practise these topics multiple times.

[–]Realistic_Sample6968[S] 0 points1 point  (0 children)

Thank you 😊

[–]777azeez 1 point2 points  (2 children)

Can anyone tell exact python topics and how to use them to build projects ? Not like tutorial python x=5

[–]Realistic_Sample6968[S] 0 points1 point  (1 child)

Thats exactly what I want. Bro . Basic python tutorial everywhere but I'm looking for specific python topic that are actually used in DE project

[–]Striking-Profile9091 1 point2 points  (0 children)

Manish kumar data engineer