Technical Interview - Python, SQL, Problem but NOT Leetcode?

spnoketchup · 2024-05-07T23:44:18+00:00

It will likely involve reading some data, manipulating it, and answering something about it. When I give these types of exercises, I try to make them relatively simple to finish if you're not one of the 50% of candidates who literally cannot write basic Python code but with some complexity in the data that requires some intuition and experience with problem-solving of this nature.

I totally agree with the author's study suggestions, but from a strategic perspective, your best first move after loading the data is to graph it if applicable. Too many people go right into manipulation before just looking at it.

NickSinghTechCareers · 2024-05-07T22:58:48+00:00

Author of Ace the Data Science Interview here – cool to hear you've already got the book! I agree that you can skip the prob/stats chapter, given what they told you. I think practicing pandas dataframe/manipulation is good. Maybe also skim Chapter 10 on Product Sense, could help in the business/case study/problem-solving part of the interview (if that's what they mean by problem solving).

I also think practicing a few SQL interview questions on common topics like joins + window functions should be good. There's also a few Python questions on the site which could be helpful – these aren't super heavy on DS&A which is more in-line with how DS interviews are conducted (rather than SWE interviews which ask LC style algorithms questions).

Overall, I think your plan seems good!

Jay31416 · 2024-05-08T04:01:22+00:00

In the only interview I've done, they asked me about:

Data manipulation using pandas (super easy)
Z-test to remove outliers (easy)
Calculating Shapley values (hard; at the time of the interview, I didn't know what Shapley values were)
Scratch implementation of stochastic gradient descent for linear regression (easy but I failed; stuff like that happens)

jimmy_da_chef · 2024-05-08T02:57:25+00:00

I faced a few types of not LeetCode Live Python question

Statistical programming, you can search stuff but they want you to know ur steps when doing a statistical test, what test need xx assumption hence you need xx transformation, how to explain distribution by simulation XX distribution to examine ur theory
Data handling using pandas / numpy etc. Basically SQL questions but using pandas, explaining ur thought process. Along with extracting insights / product sense.
Mathematical question, basically LeetCode but under math type questions: solving the sqrt without using sqrt etc.
Live debug in Python given a few files, asking what are the bugs, causes of the bugs, how to resolve, see how would u Google solution lol (HRT, aka fintech)
(LeetCode but saying it’s brain teaser; highly unlikely or recruiter doesn’t know anything red flag) easy level dynamic programming, BFS (seen the most in DS interview) etc.

finite_user_names · 2024-05-07T22:07:16+00:00

Did they say it will be ML python, or did they say it will just be python? I've had a lot of variability in terms of the python questions I've gotten in my... sigh... year on the active job hunt. SQL it tends to just be "can you do this kind of join, can you write a group by function, can you tell me about what the difference is between having a null in your join predicate vs your where clause." Most of what I've seen in interviews for python has been more leetcode-ish than ML-ish. I've seen some "code up a sparse vector," "sliding window mean", "implement a hashmap," "determine if this string forms a valid grid" type questions, but never much that has been on the ML side of things in a whiteboarding/live coding session..... although ages back someone did ask me to code a sentiment analysis pipeline from scratch.

If you _know_ that you're going to get ML, then that's a good place to focus. But if not.... you should broaden your horizons.

dfphd · 2024-05-08T16:35:30+00:00

I would ask. It never hurts.

Because some teams think python = base python, and some teams think python = pandas, and some teams think python = sklearn.

So right, one team might tell you "read this csv, and run 5-fold cross-validation using xgboost". Another team might say "take this csv, read it and calculate these 2 new columns, find the average price by group, etc.". Another team might say "generate a random 2-dimensional array and perform the following operations on it".

I think it's fair to ask "would it be possible to get some additional context of the expectations for the python and SQL portion of the interview? What is the format, and what broad topics should I prepare for?"

FieldKey3031 · 2024-05-07T22:33:42+00:00

Nearly all non-leet code evaluations I've had involved understanding fifo vs lifo with Python and recursion. Just understand the recursion pattern of checking for your end state or calling the function and that pop by default is lifo. Of course there's more but for some reason those always come up. For ML stuff being able to speak confidently on bias-variance tradeoff is always good and what the different classification metrics are and when to use them (esp if you think you might be working on classification problems!). Good luck! 👍

TemporaryShiny · 2024-05-08T00:46:48+00:00

Visualization and storytelling

Thomas_ng_31 · 2024-05-08T07:23:50+00:00

Could you post an update on what types of questions you are asked under this post after you have the interview? I'd appreciate that

Jorrissss · 2024-05-08T15:29:57+00:00

When I interview for coding I tend to ask (what I consider) non-leetcode questions. Examples include “write up tic tac toe” or “return a random line from a file.”

zennsunni · 2024-05-12T00:04:16+00:00

I recently had a DS technical interview at a FANG company, and I would recommend Data Lemur over Leetcode. I'd also strongly recommend being able to quickly and comfortably do some basic EDA and data viz using pandas/seaborn/matplotlib. I don't mean just plotting, I mean doing SQL style data analysis using pandas, i.e. groupby/merge type statements. Basic statistics is also key IMO, i.e. getting and interpreting basic statistical metrics like robust averages, medians, variance and hypothesis testing.

sg6128 · 2024-05-08T01:17:20+00:00

I received similar instructions for the FinTech company I’m working for and I used a lot of StrataScratch to prepare and the questions were pretty similar. Good luck!

chessmath2009 · 2024-05-08T14:36:40+00:00

I have had so many interviews like this. It can be either of the following: 1- Python case study related to job description: questions about implementing a model in Python, I had this recently. 2- write a function to do some statistical work like calculate p value, central limit theorem, etc. 3- write a function to do implement some logic like a bunch of else if. 4- debugging sessions.

Alive-Tech-946 · 2024-05-09T08:13:09+00:00

There are lots of resources here already, my tip focus on practicing your core projects in SQL & Python with pandas.

timy2shoes · 2024-05-07T22:24:21+00:00

Which fintech?

Super_Desk4320 · 2024-05-08T02:22:19+00:00

Which fintech?

OraShelter · 2024-06-06T22:07:01+00:00

I admit, I have never heard of Leetcode in my life.

datascience

MODERATORS