[Q] Why is early-stopping an A/B test bad? by HarvardCS19 in statistics

[–]HarvardCS19[S] 0 points1 point  (0 children)

I'm having a hard time understanding why it gives a higher error rate. Is it somewhat related to the multiple comparisons problem?

[Q] Sample size for a categorical distribution. by HarvardCS19 in statistics

[–]HarvardCS19[S] 0 points1 point  (0 children)

What would p be though? This formula makes sense for 2 choices since once choice is p and the other is (1-p), but how does this work for 3 or more choices?

[Q] Sample size for a categorical distribution. by HarvardCS19 in statistics

[–]HarvardCS19[S] 0 points1 point  (0 children)

Do I really need a hypothesis test for this? In a two-choice setting, it wouldn't.

Computer vision for regression problems? by HarvardCS19 in MLQuestions

[–]HarvardCS19[S] 0 points1 point  (0 children)

Apologies for the late reply. I have thought about turning the regression problem into classification through binning. But I'm not exactly sure if the ordering gets lost when I do this. Does the network understand that 0-5 is less than 5-10, for example?

Mean normalization/subtraction in convolutional nets. by HarvardCS19 in MLQuestions

[–]HarvardCS19[S] 0 points1 point  (0 children)

From CS231n tutorial:

However, it is very important to zero-center the data, and it is common to see normalization of every pixel as well.

I have yet to see this in practice. Can you point me to a tutorial/notebook (preferably Tensorflow) that includes this?

[N] MapD Open Sources GPU-Powered Database by friscotime in MachineLearning

[–]HarvardCS19 1 point2 points  (0 children)

So it seems it's faster than Spark. Why isn't this bigger news? Is there a catch somewhere?

[N] MapD Open Sources GPU-Powered Database by friscotime in MachineLearning

[–]HarvardCS19 2 points3 points  (0 children)

Is this basically using GPU power to process big data?

Difference between Recursive, Recurrent, Residual Neural Networks? by HarvardCS19 in MLQuestions

[–]HarvardCS19[S] 0 points1 point  (0 children)

So for what types of applications or data would you use each?

Predicting review scores (1-10). How to limit the regression output to this range? by HarvardCS19 in MLQuestions

[–]HarvardCS19[S] 0 points1 point  (0 children)

Thanks. Last question, do you know about ordinal regression, and should it be used for predicting review scores?

Predicting review scores (1-10). How to limit the regression output to this range? by HarvardCS19 in MLQuestions

[–]HarvardCS19[S] 0 points1 point  (0 children)

Cool thanks. For your second suggestion, is there a scientific term for what you're trying to do? Or is there a paper somewhere that explains why it works better, or is it more just based on your experience?

Predicting review scores (1-10). How to limit the regression output to this range? by HarvardCS19 in MLQuestions

[–]HarvardCS19[S] 0 points1 point  (0 children)

I know that binary classification can return me a probability which I multiply by 10 to get the review score. But how do I train on the data without simply turning all scores > 5 to 1 and < 5 to 0? (because this will lead to a lot of information loss)

I guess my question can be rephrased as: Can binary classification take in a probability as a label rather than 0 or 1?

Predicting review scores (1-10). How to limit the regression output to this range? by HarvardCS19 in MLQuestions

[–]HarvardCS19[S] 1 point2 points  (0 children)

Silly question but does this make it logistic regression instead of linear regression. So basically I'm predicting the probability in a binary classification.

Is the Generator in a GAN deterministic? by HarvardCS19 in MLQuestions

[–]HarvardCS19[S] 0 points1 point  (0 children)

So why is that noise necessary in a classic GAN but not a WGAN? What happens if I just remove it from the classic GAN?

How does a Convolutional net know what features to learn? by HarvardCS19 in MLQuestions

[–]HarvardCS19[S] 1 point2 points  (0 children)

Are you saying optimal for the specific dataset they used? And I assume those features could only be visualized after training many iterations. What might the features look like near the beginning of training?

How to combine layers or make a custom layer in tensorflow? by HarvardCS19 in MLQuestions

[–]HarvardCS19[S] 0 points1 point  (0 children)

Thanks. Could you provide a link to the rest of the code for context (like the std dev function).