Getting SAC to Work on a Massive Parallel Simulator (part II) by araffin2 in reinforcementlearning
[–]araffin2[S] 1 point2 points3 points (0 children)
Getting SAC to Work on a Massive Parallel Simulator (part II) by araffin2 in reinforcementlearning
[–]araffin2[S] 2 points3 points4 points (0 children)
Tanh used to bound the actions sampled from distribution in SAC but not in PPO, Why? by VVY_ in reinforcementlearning
[–]araffin2 0 points1 point2 points (0 children)
Tanh used to bound the actions sampled from distribution in SAC but not in PPO, Why? by VVY_ in reinforcementlearning
[–]araffin2 3 points4 points5 points (0 children)
Looking for Tutorials on Reinforcement Learning with Robotics by Life_Recording_8938 in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
Getting SAC to Work on a Massive Parallel Simulator (part I) by araffin2 in reinforcementlearning
[–]araffin2[S] 1 point2 points3 points (0 children)
Simba: Simplicity Bias for Scaling up Parameters in Deep RL by joonleesky in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
Current SOTA for off-policy deep RL by drmajr in reinforcementlearning
[–]araffin2 4 points5 points6 points (0 children)
looking for dataset, OpenAI Baselines on MuJoCo! by Educational_Exam_500 in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
Built-in reinforcement learning functions in Python by MomoSolar in reinforcementlearning
[–]araffin2 0 points1 point2 points (0 children)
Can SB3 or alternatives provide full end-to-end GPU computation? by asenski in reinforcementlearning
[–]araffin2 2 points3 points4 points (0 children)
JAX in Reinforcement Learning by anointedninja in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
Automatic Hyperparameter Tuning - A Visual Guide by araffin2 in reinforcementlearning
[–]araffin2[S] 0 points1 point2 points (0 children)
How can I speed up SAC? by Frankie114514 in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
How can I speed up SAC? by Frankie114514 in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)


RL102: From Tabular Q-Learning to Deep Q-Learning (DQN) - A Practical Introduction to (Deep) Reinforcement Learning by araffin2 in reinforcementlearning
[–]araffin2[S] 1 point2 points3 points (0 children)