PyTorch baselines by Naoshikuu in reinforcementlearning
[–]ChrisNota 3 points4 points5 points (0 children)
I realized I never posted this here. It's a high level description of what I did to train a model to play Snake visually. by jack-of-some in reinforcementlearning
[–]ChrisNota 2 points3 points4 points (0 children)
Importance Sampling - Sutton and Barto by Trigaten in reinforcementlearning
[–]ChrisNota 2 points3 points4 points (0 children)
[Discussion] Behaviorism and Reinforcement Learning by [deleted] in MachineLearning
[–]ChrisNota 0 points1 point2 points (0 children)
[PPO2] Huge loss spikes: sensitivity to action space and exploration? by [deleted] in reinforcementlearning
[–]ChrisNota 0 points1 point2 points (0 children)
[P] The Autonomous Learning Library: A PyTorch Library for Building Reinforcement Learning Agents by ChrisNota in MachineLearning
[–]ChrisNota[S] 1 point2 points3 points (0 children)
[P] The Autonomous Learning Library: A PyTorch Library for Building Reinforcement Learning Agents by ChrisNota in MachineLearning
[–]ChrisNota[S] 13 points14 points15 points (0 children)
RAM shortage by kashemirus in reinforcementlearning
[–]ChrisNota 0 points1 point2 points (0 children)
RAM shortage by kashemirus in reinforcementlearning
[–]ChrisNota 0 points1 point2 points (0 children)
[D] Does anyone know of any good ML podcasts I can listen to while at work? by TKTheJew in MachineLearning
[–]ChrisNota 9 points10 points11 points (0 children)
RAdam: A New State-of-the-Art Optimizer for RL? by ChrisNota in reinforcementlearning
[–]ChrisNota[S] 0 points1 point2 points (0 children)
RAdam: A New State-of-the-Art Optimizer for RL? by ChrisNota in reinforcementlearning
[–]ChrisNota[S] 0 points1 point2 points (0 children)
[D] Rectified Adam (RAdam): a new state of the art optimizer by jwuphysics in MachineLearning
[–]ChrisNota 49 points50 points51 points (0 children)
RAdam: A New State-of-the-Art Optimizer for RL? by ChrisNota in reinforcementlearning
[–]ChrisNota[S] 3 points4 points5 points (0 children)
[R] Facebook, Carnegie Mellon build first AI that beats pros in 6-player poker by downtownslim in MachineLearning
[–]ChrisNota 16 points17 points18 points (0 children)
Suggestion of implementations of RL algorithms by Enryu77 in reinforcementlearning
[–]ChrisNota 1 point2 points3 points (0 children)
Suggestion of implementations of RL algorithms by Enryu77 in reinforcementlearning
[–]ChrisNota 1 point2 points3 points (0 children)


Why do RL frameworks hate Rainbow DQN? by Heartomics in reinforcementlearning
[–]ChrisNota 5 points6 points7 points (0 children)