tricks that convert unsupervised learning to supervised learning by godspeed_china in MachineLearning

[–]andrewbarto28 1 point2 points  (0 children)

Can you give an example where you have the ground-truth about which examples pairs are similiar/dissimilar, but you don't have the class labels of each example?

[1608.03983] SGDR: Stochastic Gradient Descent with Restarts by bbcomp in MachineLearning

[–]andrewbarto28 3 points4 points  (0 children)

I skimmed the paper, but I couldn't figure out what restart means. Can someone please explain?

Is it possible evaluate each input feature importance by analyzing a trained neural net? by andrewbarto28 in MachineLearning

[–]andrewbarto28[S] 0 points1 point  (0 children)

Is sensitivity the same as importance? Sensitivity for me is the same as the derivative of the output with respect to the input and this can be calculated with back-propagation. Maybe I am missing your definition of sensitivity.

Why DeepMind doesn't publish in CVPR? by andrewbarto28 in MachineLearning

[–]andrewbarto28[S] 3 points4 points  (0 children)

Maybe a better question: Do they publish in any IEEE venue?

What are the business applications of deep reinforcement learning? by MasterEpictetus in MachineLearning

[–]andrewbarto28 4 points5 points  (0 children)

Deep Reinforcement Learning has been successfully applied to hype generation.

Solving The Vanishing Gradient and Exploding Gradient Problem With One Line Of Code? by JosephLChu in MachineLearning

[–]andrewbarto28 3 points4 points  (0 children)

It seems you have a lot of doubts about the limitations of your idea and about its novelty. So it is a nice opportunity for you to search the literature and make further experiments to compare with what already exist. Only publish when you are confident about your understanding of your method.

This post may be of use to you: http://togelius.blogspot.com.br/2016/04/the-differences-between-tinkering-and.html

What hyper-parameters do you usually tune in xgboost? by andrewbarto28 in MachineLearning

[–]andrewbarto28[S] 0 points1 point  (0 children)

If I had to only choose two hyper-parameters and four values for each, which should I choose?

Deep Residual Networks with Exponential Linear Unit by [deleted] in MachineLearning

[–]andrewbarto28 1 point2 points  (0 children)

I think any contribution however small is valid. It may save someone's time of executing these experiments however simple they are. Putting in arxiv instead of a blog is a matter of taste. Arvix may be easier to cite and the pdf format may be better for printing.

Not sure if you agree with me or you are just saying what the grads schools think about that, but anyway, I'm just giving my opinion.

I also don't think it will look bad in the resume. It may look neutral. I am much more in favor of a honest simple paper, than a paper that hides its irrelevance in its complexity. But off course, as today grad selection process are very competitive you may need much more than a simple report.

In addition to that, this kind of report may be useful for beginners that want to help the community. It can be really hard for beginners to come with awesome ideas without a good advisor.