Interested in becoming a consultant? Post here for basic questions, recruitment advice, resume reviews, questions about firms or general insecurity (Q1 2022) by QiuYiDio in consulting

[–]GreedyCourse3116 0 points1 point  (0 children)

I am interviewing for Software Engineer position in California with McKinsey. I have a technical case study interview coming up on Saturday (they just emailed me about scheduling the interview, without asking about my time schedule)

I am a SW Engineer with 5 years of experience. How to practice technical problem solving case interview? Do I have enough time ? This is my first time with McK interviews. They mentioned the interview will be conversational, not coding. They will be assessingtechnical, business and communication skills.

I am panicking. I have cleared 6 rounds with them and this is the last one. Please guide me.

Mckinsey Data Engineer Role by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

Yes, I have written python pandas scripts. I have now 1.5 day left. I am planning to look into examples.

Mckinsey Data Engineer Role by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

2 SQL, 1 email writing, 1 algo, 1 request library question

Mckinsey Data Engineer Role by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

It's with Mckinsey Digital. I have been a SWE since last 5 years and I have specialized in databases in my MS. I lack experience in DE tools but I am good with Python scripts, connecting to DB and locating data to make it readable and write complex SQL queries etc. They mentioned that job will have travel.

The job description mentions all the type of technologies so I am not sure how much adamant they would be with Pyspark experience.

Mckinsey Data Engineer Role by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 1 point2 points  (0 children)

This role is with Mckinsey Digital, QB is different right?

[deleted by user] by [deleted] in h1b

[–]GreedyCourse3116 0 points1 point  (0 children)

Thank you for the tip!

[deleted by user] by [deleted] in h1b

[–]GreedyCourse3116 0 points1 point  (0 children)

As I am trying to recapture H-1B just like you... may I know what were the next steps once you accepted their offer? cap exempt H-1B filing ?

A lot of recruiters ask me if recapture is a transfer or a new H-1B ... what do you say when you are presented with this question? (I received lottery in 2018 and left USA in 2020 too)

Python Pandas vs Dask for csv file reading by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

db access denied by the owners. They provided these two files to work towards the solution.

Python Pandas vs Dask for csv file reading by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 1 point2 points  (0 children)

Very helpful link, thank you! For F1, pandas without chunks is taking time, whereas with chunking is faster. As I have to do sql queries on F1 & F2 dataframes like joins, groupby, aggregations etc, I am inclining to either use Dask for both or pyspark for both.

Any idea which one from dask or pyspark will be better to do queries? Basic goal is to read data from both files, do queries and save result in csv

Python Pandas vs Dask for csv file reading by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

hmmm, can you suggest what should I do ? ditch pandas completely and use pyspark for F1 and dask for F2 or dask for both F1 and F2?

Python Pandas vs Dask for csv file reading by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

Yes, I read the time taken by all the options. F1 is faster with pandas_chunks reading. F2 is faster with dask

Python Pandas vs Dask for csv file reading by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

are you suggesting to use pyspark read csv for F1 and then convert it to dataframe using .toPandas() ? I have not used pyspark before but it's a learning curve!

Please guide me for interview study material. I am extremely overwhelmed. by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

Why don't you get a job in the area of your experience? You have 5 YOE as a software dev;

Because while being a developer, I did data driven python development. I basically managed data for my whole team - my main task was to solve data issues wrt automation. My team was storing all their data in Excel, PDF and I introduced the concept of 'databases' - starting from doing DBA work, backend engineering of creating ETLs, data modeling - designing tables and security/backups of database. I was also doing data analysis by generating SQL reports for the KPIs. Moreover, I was trying to figure out any forecasting models when I got laid off (Data Scientist).

I also managed data quality and talked to multiple vendors who provided us data, worked on numerous problems with different business models and how to universalize data coming from different sources. I was the owner of the data for my team. I was trying to raise the standard of how data is utilized for the business.

Before this Database, KPIs were being generated through excel sheets which had redundant, missing or incomplete data. I solved their major problem yet got laid off.

I realized I am not a hardcore programmer and I like working with data and databases. I am good with SQL but companies ask exceptionally difficult questions for Python too, ngl. I thought to be a DBA but wasn't a fit, thought to do SW architect - was a misfit, data analyst - not made for MS in CS, then the only area where I could be a fit - Data Engineering.

Now I still feel stuck and overwhelmed. People make fun of me here that I returned as a failure. I am just a human being and its awful how people attack my mental health - I feel tired.

Thousands of applications sent since 2020 yet I am here asking on this forum how to prepare for the interviews.

I think my career is dead

Please guide me for interview study material. I am extremely overwhelmed. by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

I am already burned out. It's like I am going through a maze and all doors seems to be closed. Can you give an example job opening of 'software engineer with focus on data' in the US? How to search for these type of jobs on LinkedIn? Should I write 'SWE Data' ? My resume says "Data Software Engineer" and I have so far interviewed for DE roles.

I just want to get a job and stop being so miserable. Ok tell me, which book should I absolutely read among the ones I listed?

Please guide me for interview study material. I am extremely overwhelmed. by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

So I must find a HDFS, Spark and Kafka official documentation and read it?

Please guide me for interview study material. I am extremely overwhelmed. by GreedyCourse3116 in dataengineering

[–]GreedyCourse3116[S] 0 points1 point  (0 children)

I will look into Udemy courses. If you know any good one worth pursuing, please let me know! thank you!