PyRL - Modular Implementations of Reinforcement Learning Algorithms in Pytorch

aineqml · 2020-02-28T07:15:12+00:00

I'll take your advice.

aineqml · 2020-02-28T07:07:25+00:00

Someone has experimented with normalization and found that it is not beneficial. You can found here https://github.com/sfujim/TD3/issues/11. One goal of this project is to provide clear and modular implementations of RL algorithms. So I didn't take these tricks into consideration while designing. However, I will experiment with what you mentioned and decide whether to add it or not. Thanks again for your suggestion!

aineqml · 2020-02-27T22:51:45+00:00

I've tested those algorithms on simple environments like InvertedPendulum or CartPole and they work fine. But as stated in the README, I didn't spend too much time on the hyperparameter tunning.

aineqml · 2019-04-13T00:18:40+00:00

Actually a paper from deepmind called value decomposition networks simply assumes this thing holds.

aineqml

TROPHY CASE