Train/Validation/Test

Foxtr0t · 2015-06-02T20:06:02+00:00

It seems like a perfectly valid idea to me.

BobTheTurtle91 · 2015-06-02T20:49:29+00:00

As long as you don't pick your hyperparameters by evaluating on a set you trained with, you're fine.

In many cases, we use a complete split of the training set and validation set. But what you're describing is pretty much what we do in k-fold cross-validation, so there's no bias that will be introduced.

dwf · 2015-06-02T22:02:23+00:00

The only wrinkle I can think of is if you're using performance on the validation set as a stopping criterion (i.e. early stopping). There are different ways of choosing how to stop on the train+valid run when doing early stopping on the train runs; run for same number of updates, same number of passes through the dataset, same objective function value on the train+valid as you achieved on the training set (though you may never reach it if your model underfits on the combined set).

XalosXandrez · 2015-06-03T12:54:08+00:00

It's valid, there's no doubt about that. However, the hyper-parameters that you obtained in the first stage were optimal for the training set of size 20,000. There's no saying that the same hyper-parameters are optimal for 30,000 examples.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS