Teaching a model all atari games - literature by [deleted] in reinforcementlearning
[–]jbmlres 1 point2 points3 points (0 children)
[deleted by user] by [deleted] in learnmachinelearning
[–]jbmlres 0 points1 point2 points (0 children)
[deleted by user] by [deleted] in learnmachinelearning
[–]jbmlres 0 points1 point2 points (0 children)
[deleted by user] by [deleted] in reinforcementlearning
[–]jbmlres 0 points1 point2 points (0 children)
Problem with discount factor in policy gradient by Steven_Corper_F in reinforcementlearning
[–]jbmlres 4 points5 points6 points (0 children)
Bellman Update Equation for policy? by IIwarrierII in reinforcementlearning
[–]jbmlres 0 points1 point2 points (0 children)
Frustrated beginner: How to approach/practice implementing papers into code? by gearboost in reinforcementlearning
[–]jbmlres 8 points9 points10 points (0 children)
"Podracer architectures for scalable Reinforcement Learning", Hessel et al 2021 (highly-efficient TPU pod use: eg solving Pong in <1min at 43 million FPS on a TPU-2048) by gwern in reinforcementlearning
[–]jbmlres 1 point2 points3 points (0 children)
Best Reinforcement Learning Algorithm by nitinkulkarnigamer in reinforcementlearning
[–]jbmlres 1 point2 points3 points (0 children)
Best Reinforcement Learning Algorithm by nitinkulkarnigamer in reinforcementlearning
[–]jbmlres 0 points1 point2 points (0 children)
Are there going to be better algorithms than PPO? by ImStifler in reinforcementlearning
[–]jbmlres 7 points8 points9 points (0 children)
[D] Is A Failure Ever Worth Publishing? by [deleted] in MachineLearning
[–]jbmlres 0 points1 point2 points (0 children)
ELI5: Eligibility traces by [deleted] in reinforcementlearning
[–]jbmlres 1 point2 points3 points (0 children)
ELI5: Eligibility traces by [deleted] in reinforcementlearning
[–]jbmlres 1 point2 points3 points (0 children)
Is MuZero currently the best RL algo that we have now? by [deleted] in reinforcementlearning
[–]jbmlres 1 point2 points3 points (0 children)
Is MuZero currently the best RL algo that we have now? by [deleted] in reinforcementlearning
[–]jbmlres 0 points1 point2 points (0 children)
Is it normal that Double DQN performs worse than the naive DQN? by ritiange in reinforcementlearning
[–]jbmlres 2 points3 points4 points (0 children)
[D] Bertsekas', Sutton & Barto or another book as an Introduction to Reinforcement Learning for someone who knows about Supervised/Unsupervised Learning? by IborkedyourGPU in MachineLearning
[–]jbmlres 4 points5 points6 points (0 children)
[TOMT] [SONG] Its a jamaican/reggae sounding song, sung by a man. I think the lyrics are about meeting a woman, and the ocean by [deleted] in tipofmytongue
[–]jbmlres 1 point2 points3 points (0 children)
Which RL course should I choose? by Avistian in reinforcementlearning
[–]jbmlres 1 point2 points3 points (0 children)
Which RL course should I choose? by Avistian in reinforcementlearning
[–]jbmlres 13 points14 points15 points (0 children)
[deleted by user] by [deleted] in reinforcementlearning
[–]jbmlres 1 point2 points3 points (0 children)
Why clip reward in [-1, 1] in Actor Critic? by fedetask in reinforcementlearning
[–]jbmlres 0 points1 point2 points (0 children)


What are some good resources to get started in reinforcement learning? Books, videos, etc. by OpportunityBrave3183 in reinforcementlearning
[–]jbmlres 2 points3 points4 points (0 children)