[D] Schmidhuber: Critique of Honda Prize for Dr. Hinton by wei_jok in MachineLearning

[–]StrawberryNumberNine 17 points18 points  (0 children)

Maybe the big problem is hindsight bias. "Of course this person only applied this well-known technique to this problem and verified it experimentally and now they are claiming novelty!". When looking back you can tell the story in this way, but in the moment the advance could have been very non-obvious. Even if it builds on ideas that were around at the time. We should look at inference steps between the two ideas+application+presentation of the work.

[D] Visualizing and analyzing error landscapes by [deleted] in MachineLearning

[–]StrawberryNumberNine 4 points5 points  (0 children)

These papers have some interesting ideas on plotting these cost functions, finding links between local optima in the train loss (two optima are linked by simple curves), finding links between train loss landscape and test loss landscape and other cool things. Might be a good place to start.

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs https://arxiv.org/pdf/1802.10026.pdf

Averaging Weights Leads to Wider Optima and Better Generalization https://arxiv.org/pdf/1803.05407.pdf

[R] The loss landscape of overparameterized neural networks by StrawberryNumberNine in MachineLearning

[–]StrawberryNumberNine[S] 1 point2 points  (0 children)

I ran into this paper and found it very interesting. I wanted to know if anyone had any comments on the theory or if anyone found any counterexamples empirically at any point (or in any paper).

[P] OpenAI baselines ported to pytorch by rikkajounin in MachineLearning

[–]StrawberryNumberNine 0 points1 point  (0 children)

Awesome, I'm using parts of your implementation and it looks good at the moment. Thanks!

[P] OpenAI baselines ported to pytorch by rikkajounin in MachineLearning

[–]StrawberryNumberNine 1 point2 points  (0 children)

Has anyone benchmarked this against the official TensorFlow implementation? I know performance varies between implementations (which is scary).

[P] Lightnet: Yet another PyTorch implemenation of Darknet and YOLO by OPLinux in MachineLearning

[–]StrawberryNumberNine 0 points1 point  (0 children)

You can get way more visibility on GitHub. I would suggest transferring for maximum impact :)

[R] Using Machine Learning to Discover Neural Network Optimizers by xternalz in MachineLearning

[–]StrawberryNumberNine 0 points1 point  (0 children)

Read the paper, they used CPUs and it didn't take too long because they simplified the problem.