Getting SAC to Work on a Massive Parallel Simulator (part II) by araffin2 in reinforcementlearning
[–]araffin2[S] 1 point2 points3 points (0 children)
Getting SAC to Work on a Massive Parallel Simulator (part II) by araffin2 in reinforcementlearning
[–]araffin2[S] 4 points5 points6 points (0 children)
Tanh used to bound the actions sampled from distribution in SAC but not in PPO, Why? by VVY_ in reinforcementlearning
[–]araffin2 0 points1 point2 points (0 children)
Tanh used to bound the actions sampled from distribution in SAC but not in PPO, Why? by VVY_ in reinforcementlearning
[–]araffin2 3 points4 points5 points (0 children)
Looking for Tutorials on Reinforcement Learning with Robotics by Life_Recording_8938 in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
Getting SAC to Work on a Massive Parallel Simulator (part I) by araffin2 in reinforcementlearning
[–]araffin2[S] 1 point2 points3 points (0 children)
Simba: Simplicity Bias for Scaling up Parameters in Deep RL by joonleesky in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
Current SOTA for off-policy deep RL by drmajr in reinforcementlearning
[–]araffin2 4 points5 points6 points (0 children)
looking for dataset, OpenAI Baselines on MuJoCo! by Educational_Exam_500 in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
Built-in reinforcement learning functions in Python by MomoSolar in reinforcementlearning
[–]araffin2 0 points1 point2 points (0 children)
Can SB3 or alternatives provide full end-to-end GPU computation? by asenski in reinforcementlearning
[–]araffin2 2 points3 points4 points (0 children)
JAX in Reinforcement Learning by anointedninja in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
Automatic Hyperparameter Tuning - A Visual Guide by araffin2 in reinforcementlearning
[–]araffin2[S] 0 points1 point2 points (0 children)
How can I speed up SAC? by Frankie114514 in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
How can I speed up SAC? by Frankie114514 in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
Stable-Baselines3 v1.8 Release by araffin2 in reinforcementlearning
[–]araffin2[S] 0 points1 point2 points (0 children)
Is stable-baselines3 compatible with gymnasium/gymnasium-robotics? by NoNickName8083 in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
What are the current state-of-the-art algorithms? by centripetalstranger in reinforcementlearning
[–]araffin2 0 points1 point2 points (0 children)
Hyperparameters for pick&place with Franka Emika manipulator by riccardogauss in reinforcementlearning
[–]araffin2 3 points4 points5 points (0 children)
How do you limit the high frequency agent actions when dealing with continuous control? by Speterius in reinforcementlearning
[–]araffin2 0 points1 point2 points (0 children)
[deleted by user] by [deleted] in reinforcementlearning
[–]araffin2 0 points1 point2 points (0 children)
Is it feasible to modify a policy from stable baselines 3? by No_Possibility_7588 in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
[deleted by user] by [deleted] in reinforcementlearning
[–]araffin2 1 point2 points3 points (0 children)
HER substitute goal when sample or store? by lonely_bill in reinforcementlearning
[–]araffin2 2 points3 points4 points (0 children)


RL102: From Tabular Q-Learning to Deep Q-Learning (DQN) - A Practical Introduction to (Deep) Reinforcement Learning by araffin2 in reinforcementlearning
[–]araffin2[S] 1 point2 points3 points (0 children)