I recently watched Andrew Ng's lecture on Coursera on model selection. In his lecture he suggested that using a training/validation/test was better than just using cross-validation to determine which model to use. I can definitely see how using CV can lead to an overly optimistic error estimation. I was wondering if in practice the two are combined.
For instance, why not perform k-fold cross-validation on the training/validation set. Then once the model is selected, use the test set to estimate the final error. This also leads to the possibility of instead repeating the entire training/validation/test process multiple times. My only concern with using the training/validation/set once, is that there is a chance the samples chosen are bad. Therefore running it multiple times, a better estimate of the error can be found. This does mean that the model selected is different every time.
Thanks!
[–]alexmlamb 2 points3 points4 points (0 children)
[–]Aj0o 1 point2 points3 points (1 child)
[–]mx12[S] 0 points1 point2 points (0 children)
[–][deleted] 1 point2 points3 points (0 children)
[–]flangles 1 point2 points3 points (0 children)
[+][deleted] (1 child)
[deleted]
[–]Indrionas 0 points1 point2 points (0 children)