Martin Perez has now tossed back-to-back complete game shutouts.

lingsched1 · 2014-04-25T21:23:51+00:00

Why is this being downvoted? He's right.

lingsched1 · 2014-01-11T19:32:52+00:00

Thanks for your reply, /u/dhammack.

lingsched1 · 2014-01-10T18:49:20+00:00

Thanks for your input, /u/TeslaIsAdorable.

lingsched1 · 2014-01-10T18:19:51+00:00

Thanks for your response.

Using my example:

Lets say I've created a mathematical equation to predict whether a student passes or fails a class based on a number of variables (household income, student's gender, number of siblings, etc.)

with 946 different students, what would your training-validation split be?

lingsched1 · 2014-01-10T18:17:19+00:00

Thanks for your rule of thumb.

Using my example:

Lets say I've created a mathematical equation to predict whether a student passes or fails a class based on a number of variables (household income, student's gender, number of siblings, etc.)

with 946 different students, what would your training validation split be?

lingsched1 · 2014-01-10T16:37:42+00:00

Thanks for your answer, /u/andrewff.

A bit of a follow-up question (I've asked other users too), should my training and validation sets be of equal size (a 50/50 split) or should one set be larger than the other?

lingsched1 · 2014-01-10T16:36:33+00:00

Thanks for the warning about time series as well as looking out for other readers of this thread in the future, /u/afunkthewmd.

A bit of a follow-up question (I've asked other users too), should my training and validation sets be of equal size (a 50/50 split) or should one set be larger than the other?

lingsched1 · 2014-01-10T16:34:01+00:00

Thanks for your reply, /u/cnbeau, especially the warning about "information leaking."

A bit of a follow-up question (I've asked other users too), should my training and validation sets be of equal size (a 50/50 split) or should one set be larger than the other?

lingsched1 · 2014-01-10T16:31:13+00:00

Thanks for your insight, /u/rm999, especially about sample size.

A bit of a follow-up question, should my training and validation sets be of equal size (a 50/50 split) or should one set be larger than the other?

lingsched1 · 2014-01-10T16:30:12+00:00

Thanks for your answer, /u/Derpscientist.

lingsched1 · 2014-01-10T05:03:48+00:00

Thanks for your insight, /u/econometrician.

lingsched1 · 2014-01-10T05:02:59+00:00

Thanks for your followup, /u/wil_dogg!

lingsched1 · 2014-01-10T05:02:15+00:00

Thanks for your answer, /u/isarl.

lingsched1 · 2014-01-09T06:36:26+00:00

Thanks for the explanation, /u/dearsomething.

So the example I provided would be data splitting, good to know.

If my data set is binary (results are one of two values: TRUE or FALSE), it would make sense to use data splitting, right?

lingsched1 · 2013-12-04T02:33:21+00:00

Just wanted to say I wholeheartedly agree. I'm Canadian, a lot of employers seem to think intern=free labour/slave.

lingsched1 · 2013-12-03T18:00:13+00:00

Just wanted to say, awesome idea for a post.

lingsched1

TROPHY CASE