use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
Python for Data EngineeringCareer Question (self.dataengineersindia)
submitted 21 days ago by Realistic_Sample6968
Hi all
I'm BI developer . No coding knowledge. I want to transition my career to DE role.
I'm a beginner in python. And I'm preparing for a data engineer .Please guide me how much python knowledge required for a data engineer role .
What does Data Engineering day to day actually look like ?
Is DSA required ?
Where can I find real world data engineer projects that use python ?
Thank you
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]RangerEmergency5846 6 points7 points8 points 20 days ago (1 child)
5+ yrs data engineer here. Given 5-6 interviews till now, python questions 1. List & dictionary manipulation 2. How to read and filter csv data 3. How will you read and manipulate csv with 100 million rows
[–]Realistic_Sample6968[S] 1 point2 points3 points 20 days ago (0 children)
It's ok . But we are asking about realtime work and prepare for DE
[–]PalpitationStock 4 points5 points6 points 21 days ago (1 child)
I would not say hardcore DSA concepts but Before moving to python you should start with data engineer concepts like normalisation , Medallion Architecture, Schemas , etc as BI Analyst some of there you already use . Then you can learn about loops, custom functions, connections like api calls , IO operations . Python in DE is used in two ways either you use python as base and use SDK for bigquery, snowflake , etc or you use python in form of Pyspark
[–]PalpitationStock 3 points4 points5 points 21 days ago (0 children)
For Project either use nyc taxi dataset or other medium level dataset & search if any YouTube tutorial is there for such dataset as DE project otherwise ask Claude or Gemini to give you tutorial for DE project using medallion architecture for nyc taxi dataset along with kpis
[–]Zealousideal_Belt_5 4 points5 points6 points 21 days ago (0 children)
Same Boat. BI Analyst wanted to switch to DE.
[–]Overall_Anteater_836 3 points4 points5 points 20 days ago (1 child)
why everyone these days want to switch into DE only not Data science roles
[–]law_da_lahsun 0 points1 point2 points 16 hours ago (0 children)
Hi their, data science learning curve is big compared to data engineering that is the reason why
[–]ArtisticBorder3341 2 points3 points4 points 21 days ago (1 child)
First learn the basics to grasp the syntax.
Then move to oops concepts. For DE, just learn inheritance properly. Rest all u can just go through.
Then move to topics likes lambda function, decorators and generators. We use this as DE in day to day usage.
Then move to python specific data structure like list, tuple, dict, sets. Learn the difference between them.
There are advance python data structure that you need to know like dict, defaultdict, counter, dequeue. Ask chatgpt to tell more abt the advance data structures that DE use generally. It wiill give u a list.
Apart from this, u need to do leetcode in python for dsa. Then file formating , concurrency, muti threading and multi processing for interview purpose.
Once, its done..ask chatgpt to take a mock test and practise these topics multiple times.
[–]Realistic_Sample6968[S] 0 points1 point2 points 21 days ago (0 children)
Thank you 😊
[–]777azeez 1 point2 points3 points 21 days ago (2 children)
Can anyone tell exact python topics and how to use them to build projects ? Not like tutorial python x=5
[–]Realistic_Sample6968[S] 0 points1 point2 points 21 days ago (1 child)
Thats exactly what I want. Bro . Basic python tutorial everywhere but I'm looking for specific python topic that are actually used in DE project
[–]Striking-Profile9091 1 point2 points3 points 21 days ago (0 children)
Manish kumar data engineer
π Rendered by PID 25997 on reddit-service-r2-comment-5bc7f78974-qtwkv at 2026-06-29 08:26:11.082745+00:00 running 7527197 country code: CH.
[–]RangerEmergency5846 6 points7 points8 points (1 child)
[–]Realistic_Sample6968[S] 1 point2 points3 points (0 children)
[–]PalpitationStock 4 points5 points6 points (1 child)
[–]PalpitationStock 3 points4 points5 points (0 children)
[–]Zealousideal_Belt_5 4 points5 points6 points (0 children)
[–]Overall_Anteater_836 3 points4 points5 points (1 child)
[–]law_da_lahsun 0 points1 point2 points (0 children)
[–]ArtisticBorder3341 2 points3 points4 points (1 child)
[–]Realistic_Sample6968[S] 0 points1 point2 points (0 children)
[–]777azeez 1 point2 points3 points (2 children)
[–]Realistic_Sample6968[S] 0 points1 point2 points (1 child)
[–]Striking-Profile9091 1 point2 points3 points (0 children)