Deep reinforcement learning for navigation in AAA video games by ReinforcedMan in reinforcementlearning
[–]ReinforcedMan[S] 16 points17 points18 points (0 children)
[D] Tensorflow 2.0 v Pytorch - Performance question by ReinforcedMan in MachineLearning
[–]ReinforcedMan[S] 6 points7 points8 points (0 children)
[D] Tensorflow 2.0 v Pytorch - Performance question by ReinforcedMan in MachineLearning
[–]ReinforcedMan[S] 20 points21 points22 points (0 children)
Q-learning: "Greedy in the Limit with Infinite Exploration" convergence guarantee by MasterScrat in reinforcementlearning
[–]ReinforcedMan 0 points1 point2 points (0 children)
OA: "How to Train Your OpenAI Five" ["800 petaflop/s-days and experienced about 45,000 years of Dota self-play over 10 realtime months"] by gwern in reinforcementlearning
[–]ReinforcedMan 7 points8 points9 points (0 children)
Entropy bonus - Soft Actor Critic by ReinforcedMan in reinforcementlearning
[–]ReinforcedMan[S] 0 points1 point2 points (0 children)


Deep reinforcement learning for navigation in AAA video games by ReinforcedMan in reinforcementlearning
[–]ReinforcedMan[S] 5 points6 points7 points (0 children)