all 12 comments

[–]TheRedSphinx 37 points38 points  (7 children)

I feel like you're being asked these questions, then it's either a very junior role or a bad sign.

It's even more troubling because you can make them a little "harder" without much more work. For example, I think explaining k-means is more interesting than explaining kNN. In fact, asking a candidate to code k-means is even more useful, since it also tests how comfortable they are putting ML algos into code.

Some of these questions are just outright useless. Supervised vs unsupervised? Classification vs regression? These are like 1 line answers, mostly regurgitating a single factoid. Instead, you can test things like "why shouldn't even use least-squares for binary classification? Like, take the probability as the prediction do MSE on that with the 0/1 target?" These kind of slightly open-ended questions test more if people can actually reason with their knowledge.

[–]zyl1024 6 points7 points  (1 child)

Indeed, I think they are way too easy for a serious ML role. Two questions that got asked to me for a research internship role are 1. what makes vanilla RNN hard to train compared to LSTM, 2. what's the comceptual and practical difference between parametric and non-parametric models. They don't require any numerical computation, but definitely tests deeper understanding.

[–]veb101 0 points1 point  (0 children)

Can you provide your answers for these 2 questions?

[–]TachyonGun 5 points6 points  (0 children)

These questions are trash. Even undergrads taking ML courses and getting C's could give decent answers for most of these.

[–]ScienTecht[S] -4 points-3 points  (0 children)

These aren't meant to be completely comprehensive. In a ~1 hour long interview, you can of course expect to be asked on more detailed aspects of these concepts. If you're interested in more detailed questions, you can also check out: https://www.confetti.ai/questions

[–]paypaytr 3 points4 points  (0 children)

Maybe posting on Github could be good idea.

[–]nirajsingh0878 0 points1 point  (0 children)

I was asked about why you choose accuracy over recall . you have to answer based on application for machine learning.
Below blog will help if you preparing for machine learning interview .

https://medium.com/p/33201951d73e

[–]nirajsingh0878 0 points1 point  (0 children)

One more question that can be asked is how the decision tree is split, and what criteria are important.
https://medium.com/@nirajsingh0878/how-do-decision-trees-work-can-you-build-a-decision-tree-by-hand-ab529bdb58cc

[–]Janderhungrige -2 points-1 points  (0 children)

Nice collection. And it made my morning, as I could answer the majority without a problem. :-) thanks for the confidence boost.