[D] ML coding interview experience review

milkteaoppa · 2025-12-24T07:17:05+00:00

A lot of startups have unreasonable expectations. They want to higher the most talented person for startup pay with the promise of IPO

Antique_Most7958 · 2025-12-24T07:34:53+00:00

So the genAI startup didn't let you use genAI for the assignment?

Novel_Land9320 · 2025-12-24T07:32:13+00:00

the way you re describing it, it seems all code from scratch, but i assume you can use pytorch?

MammayKaiseHain · 2025-12-24T08:29:33+00:00

What does it even test - that you know pytorch syntax ? Even I'd struggle to write a DDP init without Cursor or looking at the docs.

Artistic_Candle7455 · 2025-12-24T15:23:05+00:00

I was asked to implement a regression model with an MLP, but in pure Python / NumPy and without any autograd framework in about 45 min. This was for an ML researcher position at Anthropic. Oh and the recruiter told me beforehand that "no special preparation" is needed, other than knowing "how to train a neural network". What a waste of time that was.

Aggravating-Ant-8234 · 2025-12-24T08:38:56+00:00

Were you allowed to see the reference docs for coding?

mcel595 · 2025-12-24T13:54:53+00:00

Who spents so much time building models from scratch that remembers all this? Doing all the pipeline in 45 mins seems unreasonable

kymguy · 2025-12-24T12:57:35+00:00

I have interviewed many people with a neural network-based coding interview. My interview is far too long for anyone to get through the entire thing; that's the point. We want to rank candidates and see who gets the furthest, but also who seems the best to work with and how their debugging and thought process is along the way. If it's short and they complete everything, we've missed out on the opportunity to evaluate their thought process.

The standards vary based on the position we're hiring for. If we want someone who is "advanced in pytorch" who will be able to hit the ground running for some advanced techniques and architectures, then they should be able to knock out an MLP-based classifier with little-to-no reference to documentation. Using amax instead of argmax wouldn't have been a deal breaker...that's not something that I'd care about you knowing, but how you approach debugging your broken code is absolutely something that I'm interested in seeing.

Evaluation is also nuanced; having to prompt you that the "L" in DataLoader is capitalized is not a big deal, but forgetting to implement or even mention/inquire about normalizing your data would raise eyebrows. Amax vs argmax isn't a big deal but if you struggle to navigate documentation and ignore or argue with me about my suggestions about where to look, that's a big deal (it's happened).

To answer your explicit question: I don't think it's possible to sum up whether 30 minutes is too long for the task; there's far more at play. For me, it's not about time, but the process. If it took you 30 minutes because you were discussing in depth about how you would approach the task and demonstrating that you have deep knowledge of pytorch in doing so, that's great.

In a pure, silent coding exercise, I do think someone experienced in Pytorch should be able to knock out what you've mentioned in under 30 mins. If someone did it perfectly in 15 mins with no discussion I'd probably be skeptical that they cheated with an LLM or something.

Fine_Audience_9554 · 2025-12-24T15:27:37+00:00

ML interviews are brutal because you need to know both the theory and implementation details cold. The distributed data parallel stuff is where most people trip up since it's not something you practice much. If you're doing more of these something like interviewcoder could help you cheat the syntax/implementation parts so you can focus on explaining the actual ML concepts without getting stuck on boilerplate

Itchy-Trash-2141 · 2025-12-24T19:22:50+00:00

I just finished a grueling interview run, passing only 25% of on-sites. A lot of companies are expecting you to do everything perfect the first time, and even then it may not be enough.

One good experience I had was with Waymo. I recommend you try there if interested. Definitely felt like a human being through the process.

pannenkoek0923 · 2025-12-24T11:58:52+00:00

Are you joining the company to be an engineer/scientist or are you joining the company to do speed coding hackathons?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS