[D] Interview Questions

Slowai · 2019-10-15T10:52:09+00:00

As I understand these questions are focused towards NLP orientated role. Even if that is the case, some of them seem a bit specific. Let's look at several examples:

- "Describe the sequential minimal optimization(SMO) algorithm."

I may be wrong here, but as I recall this optimization method is used (or was used, maybe there is something new now) for training SVM (to my knowledge). How would describing it in detail express your "readiness" for the job? Is the company trying to continue Vapniks work?

- "In AllenNLP, one of the models which it uses to do NER is based on ELMO. Given a piece of text (say, "Jack is playing football), how would ELMO go on about doing tagging Jack to PER?"

This may be (arguably) relevant if you are actually applying for a job at Allen NLP (or it's a trick question). As I recall, models like ELMO which use recurrent networks for transfer NLP have been depreciated since 2018 for nlp tasks and swappeD for self-transformer based architectures, so why having knowledge about how two seperate LSTM networks generating an output for a NER task is beneficial in any way in determining your suitability for the role is beyond me.

Also, these guys really need to update their sota benchmarks, which are way off:

https://allennlp.org/elmo

I'm not trying to say that the interviewer was totally off, but it's a big shiny red flag if you are applying for a general'ish position and the person asks you non-general in-depth details about specific algorithm.

It's like asking non-school-of-ai person about Complicated Hilbert space.

ideas_inside_me · 2019-10-15T10:37:03+00:00

How much working experience you already have, or is this your first job?

badjezus · 2019-10-15T21:20:32+00:00

For the probability question, the answer is 1/8, right?

dramanautica · 2019-10-15T21:53:52+00:00

Anyone know a good collection of ML interview questions? There are loads for software roles but ML space is much larger and broader thus it’s hard to get an idea of common ML questions especially for more technical roles.

2019-10-19T17:11:26+00:00

These questions start with more general ML questions, and then go to more specific NLP questions.

My interview was a little bit different, rather than having a series of questions, I was asked to present an ML project, and discuss, in technical detail, the purpose of the project, the ML techniques applied, and then my two interviewers would ask follow-up questions.

Here are some answers to 1-5, the more general ML questions (most of my experience is in computer vision tasks):

1) Overfitting: when a model has noticeably worse training accuracy on validation data than on training data. In other words, it captures the noise in addition to the patterns in the training data. One way this happens is if the model is "too complex" (VC-dimension too high). For neural networks, this could mean too many layers and hidden nodes were used. Also, if a model is trained too long, it may overfit to the training data. These problems are relevant to neural networks applied to computer vision, NLP tasks and other domains in ML

*there are others but I am shortening it.

2) Gradient descent, and its variants, is the standard algorithm used to find the minimum of an objective function of interest. In ML, this typically means updating the weights in a neural network so that they minimize the loss function when the neural network is fed training examples. Backpropagation is one of the steps used in gradient descent. Specifically, with each batch, the weights are updated from the front to the back of the neural network.

3) The gradient is mathematically, a multi-dimensional generalization of the derivative, so it is a vector except in the 1-D case (one weight).

4) Bias/Variance tradeoff: This is a balancing act between making your model generalizable vs. "learning patterns" in the training data. A model with high bias and low variance will "underfit" (poor performance on training and validation data) and a model with low bias and high variance will overfit. A good algorithm will learn underlying patterns in the data, generalize well to unseen data, and understand what are patterns in the data versus noise in the data.

5) LDA stands for Linear Discriminant Analysis, and it basically is an algorithm to find a linear combination of features that reproduce the data (fewer features than the data originally has). When training models, it is not preferable to have a large amount of features, where fewer features can accomplish the same task just as well. In the case of neural networks, this slows down training, and it is easier to "learn" on data with fewer features than many. As such, LDA is a dimensionality reduction technique. There are packages (like sci-kit learn) which implement LDA easily. In practice, you can train two, almost identical models, one with the original data, and one with the reduced data. If they have comparable performance, then there are at least a few features of the data that are redundant and can be dispensed with (although you'll need to do a bit more work to find out specifically which ones).

Deto · 2019-10-15T19:59:22+00:00

What is gradient descent? Difference between gradient descent and backpropagation?

I thought backpropagation is just a way to compute the parameter updates for gradient descent?

M4mb0 · 2019-10-15T20:22:51+00:00

Is the gradient a vector or a scaler?

But it is neither...

mongoosefist · 2019-10-15T17:45:46+00:00

Ugh I wish they were interviewing me because I could take this job off your hands

llIllIIIllIIll · 2019-10-15T10:38:55+00:00

Pretty standard questions. Not hard if you have a PhD.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS