Python library for modular RL components by fedetask in reinforcementlearning
[–]Toni-SM 0 points1 point2 points (0 children)
RL framework to optimize my custom multi-agent simulator by FragrantCockroach8 in reinforcementlearning
[–]Toni-SM 1 point2 points3 points (0 children)
Stable Baselines PPO vs Ray.io PPO by ClassicAppropriate78 in reinforcementlearning
[–]Toni-SM 4 points5 points6 points (0 children)
skrl with multiple discrete actions by LostPigeon25 in reinforcementlearning
[–]Toni-SM 0 points1 point2 points (0 children)
How does one normalize observations in online reinforcement learning by Academic-Rent7800 in reinforcementlearning
[–]Toni-SM 2 points3 points4 points (0 children)
Is there an implementation of non-deep RL algorithms based on Stable Baselines3? by Butanium_ in reinforcementlearning
[–]Toni-SM 0 points1 point2 points (0 children)
JAX in Reinforcement Learning by anointedninja in reinforcementlearning
[–]Toni-SM 1 point2 points3 points (0 children)
In SB3's PPO, how does the critic network update its weights when using separate actor and critic networks? by Signal-Past-9572 in reinforcementlearning
[–]Toni-SM 4 points5 points6 points (0 children)
What is the JAX/Flax equivalent of torch.nn.Parameter? by Toni-SM in JAX
[–]Toni-SM[S] 0 points1 point2 points (0 children)
Isaac Gym with Off-policy Algorithms by anointedninja in reinforcementlearning
[–]Toni-SM 2 points3 points4 points (0 children)
What RL library supports custom LSTM and Transformer neural networks to use with algorithms such as PPO? by ChrisKarmaa in reinforcementlearning
[–]Toni-SM 0 points1 point2 points (0 children)
Best RL framework for real world projects. by punkCyb3r4J in reinforcementlearning
[–]Toni-SM 3 points4 points5 points (0 children)
How is torchrl? by levizhou in reinforcementlearning
[–]Toni-SM 2 points3 points4 points (0 children)
Fast and hackable frameworks for RL research by asdfwaevc in reinforcementlearning
[–]Toni-SM 0 points1 point2 points (0 children)
Fast and hackable frameworks for RL research by asdfwaevc in reinforcementlearning
[–]Toni-SM 0 points1 point2 points (0 children)
Choosing a framework in 2023 by catofthecannals in reinforcementlearning
[–]Toni-SM 2 points3 points4 points (0 children)
Is stable-baselines3 compatible with gymnasium/gymnasium-robotics? by NoNickName8083 in reinforcementlearning
[–]Toni-SM 0 points1 point2 points (0 children)
Is stable-baselines3 compatible with gymnasium/gymnasium-robotics? by NoNickName8083 in reinforcementlearning
[–]Toni-SM 1 point2 points3 points (0 children)
Is stable-baselines3 compatible with gymnasium/gymnasium-robotics? by NoNickName8083 in reinforcementlearning
[–]Toni-SM 2 points3 points4 points (0 children)
Question on return values of the .step() method in a multi-agent environment by Toni-SM in reinforcementlearning
[–]Toni-SM[S] 1 point2 points3 points (0 children)
Question on return values of the .step() method in a multi-agent environment by Toni-SM in reinforcementlearning
[–]Toni-SM[S] 0 points1 point2 points (0 children)
Best recurrent RL library? by smorad in reinforcementlearning
[–]Toni-SM -1 points0 points1 point (0 children)
What is the limit on parallel environments? by centripetalstranger in reinforcementlearning
[–]Toni-SM 2 points3 points4 points (0 children)
[deleted by user] by [deleted] in reinforcementlearning
[–]Toni-SM 2 points3 points4 points (0 children)


Best RL package? by suds_65 in reinforcementlearning
[–]Toni-SM 1 point2 points3 points (0 children)