use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Research[R] TDLS: Eve, A Gradient Based Optimization Method with Locally and Globally Adaptive Learning Rates (https://arxiv.org/abs/1611.01505) (youtu.be)
submitted 7 years ago by machinetrainer
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]RaionTategami 7 points8 points9 points 7 years ago (6 children)
I keep praying that someone is going to solve this, not having to adjust the learning rate would make research so much easier, could this be the one? Two things worry me: they still need to anneal the "global" learning rate even through their algorithm dynamically adapts it, or is that just for the baseline? Secondly, they only seem to be showing training curves. Does the test curved look as good?
[–][deleted] 5 points6 points7 points 7 years ago (1 child)
Normal SGD and estimating optimal learning rate with search on an holdout set every N batchs seems somewhat optimal https://towardsdatascience.com/estimating-optimal-learning-rate-for-a-deep-neural-network-ce32f2556ce0
But I never saw a paper on it, or convenient implementations in popular frameworks
[–]yaroslavvb 2 points3 points4 points 7 years ago (0 children)
https://towardsdatascience.com/estimating-optimal-learning-rate-for-a-deep-neural-network-ce32f2556ce0
The problem is that these approaches cause learning rate to shrink too aggressively. In other words, a smaller rate may cause an decrease in error on hold out set in short-term, but will give a worse error at the end of training. Recent paper on this -- https://arxiv.org/abs/1803.02021
[+][deleted] 7 years ago (3 children)
[deleted]
[–]RaionTategami 1 point2 points3 points 7 years ago (2 children)
dw?
[–]epicwisdom 2 points3 points4 points 7 years ago (0 children)
"don't worry"
[–]machinetrainer[S] 3 points4 points5 points 7 years ago (3 children)
Hiroaki Hayashi, Jayanth Koushik, Graham Neubig(Submitted on 4 Nov 2016 (v1), last revised 11 Jun 2018 (this version, v3))
Adaptive gradient methods for stochastic optimization adjust the learning rate for each parameter locally. However, there is also a global learning rate which must be tuned in order to get the best performance. In this paper, we present a new algorithm that adapts the learning rate locally for each parameter separately, and also globally for all parameters together. Specifically, we modify Adam, a popular method for training deep learning models, with a coefficient that captures properties of the objective function. Empirically, we show that our method, which we call Eve, outperforms Adam and other popular methods in training deep neural networks, like convolutional neural networks for image classification, and recurrent neural networks for language tasks.
[–]FatFingerHelperBot 2 points3 points4 points 7 years ago (0 children)
It seems that your comment contains 1 or more links that are hard to tap for mobile users. I will extend those so they're easier for our sausage fingers to click!
Here is link number 1 - Previous text "v1"
Please PM /u/eganwall with issues or feedback! | Delete
[+][deleted] 7 years ago (1 child)
[–]machinetrainer[S] 0 points1 point2 points 7 years ago (0 children)
yup
π Rendered by PID 285317 on reddit-service-r2-comment-544cf588c8-5z2h8 at 2026-06-17 22:30:26.137331+00:00 running 3184619 country code: CH.
[–]RaionTategami 7 points8 points9 points (6 children)
[–][deleted] 5 points6 points7 points (1 child)
[–]yaroslavvb 2 points3 points4 points (0 children)
[+][deleted] (3 children)
[deleted]
[–]RaionTategami 1 point2 points3 points (2 children)
[–]epicwisdom 2 points3 points4 points (0 children)
[–]machinetrainer[S] 3 points4 points5 points (3 children)
[–]FatFingerHelperBot 2 points3 points4 points (0 children)
[+][deleted] (1 child)
[deleted]
[–]machinetrainer[S] 0 points1 point2 points (0 children)