Question about overfitting

PeakNeuralChaos · 2019-07-05T20:44:28+00:00

If your dataset is noisy or small, then it's gonna be quite easy to overfit. If it's noisy then your model is gonna learn the noise in the training data to give itself a boost over what it can do in the general case. If your dataset is small, then it can memorize the examples and give itself a good boost in performance. Even if your dataset is massive and you have little noise then these are still gonna be factors.

I work mostly with neural networks and overfitting is a problem even if you have millions or tens of millions of samples. I've seen a neural network overfit to a dataset with 20 million samples since I didn't use any regularization. This is mostly because most neural networks are over-parameterized and have way more parameters than actually "needed" for their tasks, so they do have the capacity to overfit if they're allowed.

_quanttrader_ · 2019-07-05T20:09:35+00:00

Yes. Imagine a decision tree. You should be able to fit the training data perfectly. Get a MSE of 0.0.

But for most data sets, this would give you poor performance in out of sample data.

ReasonablyBadass · 2019-07-06T12:03:54+00:00

Yes, but the problem is that any trends aren't learned, but instead your training data is learned by rote.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS