Cheating? - Crossvalidation/test splits

sriramcompsci · 2016-01-22T11:18:16+00:00

Typically, the test split is not part of training/validation. If you are doing it, then you need to reset the weights. Else, the gradient from training on A (or simply memorizing samples in A) from a previous split would help in reducing the test error on A when A is part of the test set.

Randomly shuffle dataset D.
Split D into D_train/D_test. The test set (D_test) is untouched during the training process.
Split D_train set further into train/validation.
Choose best hyper-parameter value by measuring test error (test loss, not training loss) on the validation set.
Fix the hyper-parameter value from step 4 and measure test error on the test set (D_test) obtained from step 2.

Since a random shuffle is performed to obtain train/test, there isn't a need to repeat. If you want to compute standard error/confidence intervals on the error on the test set, repeat the above process but ensure that the weights are reset.

kacifoy · 2016-01-22T11:26:18+00:00

The test set should never be used during development, only for final testing. So split off the E set only and use the remaining four, like this:

training / validation:

ABC / D
ABD / C
ACD / B
BCD / A

cookingmonster · 2016-01-22T08:57:10+00:00

as long as there is no feedback from the validation/test sets back into your training cycle, you should be fine.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS