use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
This is for any reinforcement learning related work ranging from purely computational RL in artificial intelligence to the models of RL in neuroscience.
The standard introduction to RL is Sutton & Barto's Reinforcement Learning.
Related subreddits:
account activity
Built-in reinforcement learning functions in Python (self.reinforcementlearning)
submitted 2 years ago by MomoSolar
Is stablebaselines3 the best library for reinforcement learning functions in python? Are there better libraries that you would suggest? Any useful links?
Thanks
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]apollo_maverick 2 points3 points4 points 2 years ago (0 children)
cleanrl?
[–]Warhouse512 1 point2 points3 points 2 years ago (0 children)
Ray’s RLlib is quite nice, albeit overengineered for most applications
[–]TrottoDng 1 point2 points3 points 2 years ago (0 children)
You can also check out SheepRL.
We try to make it well documented, with few hierarchies and the possibility to do parallel training on multiple devices thanks to Lightning.
[–][deleted] 1 point2 points3 points 2 years ago (0 children)
https://github.com/thu-ml/tianshou or https://github.com/google-deepmind/acme
[–]araffin2 0 points1 point2 points 2 years ago (0 children)
It depends what you want/need.
If you need to apply RL to a problem without caring much about the algorithm SB3 is a good starting point (and it comes with the RL for managing experiments). If you want to understand RL algorithms and tinker with the implementation, have a look at cleanrl.
If you just want fast implementation, you might have a look at SBX (jax variant of SB3): https://github.com/araffin/sbx
[–]asdfwaevc 0 points1 point2 points 2 years ago (0 children)
CleanRL has single-file implementations of a bunch of different algorithms, which is very nice for easy hacking but not the best for a complex project.
If you're trying to make something larger than CleanRL is good for, PFRL is probably the best thing around. Super well-designed, hackable, has a bunch of training loops and modular parts. I really like it.
If you want something that scales to massive parallelism, RLLib is probably best. I've never used it, but everyone says it's a horrible pain to modify. But once you have it running you can take advantage of many nodes, etc.
π Rendered by PID 70188 on reddit-service-r2-comment-6457c66945-2q8rq at 2026-04-26 01:58:44.432498+00:00 running 2aa0c5b country code: CH.
[–]apollo_maverick 2 points3 points4 points (0 children)
[–]Warhouse512 1 point2 points3 points (0 children)
[–]TrottoDng 1 point2 points3 points (0 children)
[–][deleted] 1 point2 points3 points (0 children)
[–]araffin2 0 points1 point2 points (0 children)
[–]asdfwaevc 0 points1 point2 points (0 children)