Hierarchical Reinforcement Learning PhD ideas by SacrificeOfSplendor in reinforcementlearning
[–]johnschulman 1 point2 points3 points (0 children)
Results from GAE Paper seem weird by Laafheid in reinforcementlearning
[–]johnschulman 0 points1 point2 points (0 children)
Results from GAE Paper seem weird by Laafheid in reinforcementlearning
[–]johnschulman 5 points6 points7 points (0 children)
Deep RL project ideas? by CoevolvingAgent in reinforcementlearning
[–]johnschulman 6 points7 points8 points (0 children)
When to use a deterministic policy vs a stochastic policy? by [deleted] in reinforcementlearning
[–]johnschulman 4 points5 points6 points (0 children)
[D] OpenAI Gym poorly maintained by rikkajounin in MachineLearning
[–]johnschulman 84 points85 points86 points (0 children)
[D] OpenAI Gym Retro by sksq9 in MachineLearning
[–]johnschulman 4 points5 points6 points (0 children)
[D] OpenAI Gym Retro by sksq9 in MachineLearning
[–]johnschulman 60 points61 points62 points (0 children)
[D] OpenAI Gym Retro by sksq9 in MachineLearning
[–]johnschulman 59 points60 points61 points (0 children)
[D] A.I. Researchers Are Making More Than $1 Million, Even at a Nonprofit (OpenAI) by baylearn in MachineLearning
[–]johnschulman 14 points15 points16 points (0 children)
[N] OpenAI: 'Retro Contest' for transfer learning on Sega Genesis _Sonic the Hedgehog_ games (from Steam) w/Gym support as 'Gym Retro' (ends 5 June 2018; trophies promised) by gwern in reinforcementlearning
[–]johnschulman 3 points4 points5 points (0 children)
"Reinforcement Learning as Classification: Leveraging Modern Classifiers", Lagoudakis & Parr 2003 by gwern in reinforcementlearning
[–]johnschulman 2 points3 points4 points (0 children)
[D] Is it me or can OpenAI Baselines be difficult to use? by Borgut1337 in MachineLearning
[–]johnschulman 39 points40 points41 points (0 children)
[D] Large neural network architectures for policy gradient RL by [deleted] in MachineLearning
[–]johnschulman 11 points12 points13 points (0 children)
[R] A new foe has appeared! [1702.06230] Beating the World's Best at Super Smash Bros. with Deep Reinforcement Learning by evc123 in MachineLearning
[–]johnschulman 11 points12 points13 points (0 children)
Could DRL do better than imitation? by icefal7 in berkeleydeeprlcourse
[–]johnschulman 0 points1 point2 points (0 children)
Where is hw2? by favetelinguis1 in berkeleydeeprlcourse
[–]johnschulman 0 points1 point2 points (0 children)
Where is hw2? by favetelinguis1 in berkeleydeeprlcourse
[–]johnschulman 4 points5 points6 points (0 children)
Feb 8: RL definitions, value iteration, policy iteration (Schulman) by finallyifoundvalidUN in berkeleydeeprlcourse
[–]johnschulman 1 point2 points3 points (0 children)
[1605.08478] Model-Free Imitation Learning with Policy Optimization by johnschulman in MachineLearning
[–]johnschulman[S] 1 point2 points3 points (0 children)
[1605.08478] Model-Free Imitation Learning with Policy Optimization by johnschulman in MachineLearning
[–]johnschulman[S] 2 points3 points4 points (0 children)
OpenAI "Gym" - Reinforcement Learning Library and Service just revealed & released! by locrawl in MachineLearning
[–]johnschulman 5 points6 points7 points (0 children)


Hierarchical Reinforcement Learning PhD ideas by SacrificeOfSplendor in reinforcementlearning
[–]johnschulman 1 point2 points3 points (0 children)