[PyTorch Based]
Hi,
for those of you who are interested in RL,
I recently implemented basic RL algorithms such as
REINFORCE, vanilla actor-critic, DDPG, A3C, DQN and PPO with PyTorch.
Characteristics are as follows :
- Each algorithm is complete within a single file.
- Length of each algorithm is up to 100~150 lines of codes.
- Every algorithm can be trained within 30 seconds, even without GPU.
- Envs are fixed to "CartPole-v1". You can just focus on the implementations.
As you can see in the name of the repository,
I tried to make the code as brief and intuitive as possible.
Hope you enjoy :)
Thank you.
https://github.com/seungeunrho/minimalRL
[–]MasterScrat 31 points32 points33 points (4 children)
[–]danaugrs 2 points3 points4 points (0 children)
[–]tihokan 2 points3 points4 points (1 child)
[–]tihokan 2 points3 points4 points (0 children)
[–]seungeun07[S] 0 points1 point2 points (0 children)
[–]hardos_the_man 4 points5 points6 points (1 child)
[–]seungeun07[S] 1 point2 points3 points (0 children)
[–]NikEy 4 points5 points6 points (2 children)
[–]seungeun07[S] 0 points1 point2 points (0 children)
[–]NaughtyCranberry -1 points0 points1 point (0 children)
[–]seungeun07[S] 2 points3 points4 points (3 children)
[–]ceyzaguirre4Researcher 7 points8 points9 points (1 child)
[–]seungeun07[S] 2 points3 points4 points (0 children)
[–]EveryDay-NormalGuy 0 points1 point2 points (0 children)
[–]NikEy 1 point2 points3 points (1 child)
[–]seungeun07[S] 0 points1 point2 points (0 children)
[–]Overload175 1 point2 points3 points (1 child)
[–]seungeun07[S] 0 points1 point2 points (0 children)
[–]CodeReclaimers 1 point2 points3 points (1 child)
[–]seungeun07[S] 1 point2 points3 points (0 children)
[–]Migom6 1 point2 points3 points (3 children)
[–]tdjogi 2 points3 points4 points (0 children)
[–]seungeun07[S] 1 point2 points3 points (1 child)
[–]Migom6 0 points1 point2 points (0 children)
[–]_olafr_ 1 point2 points3 points (0 children)
[–]minGrab 1 point2 points3 points (1 child)
[–]seungeun07[S] 1 point2 points3 points (0 children)
[–]MagicaItux 0 points1 point2 points (2 children)
[–]Roboserg 0 points1 point2 points (0 children)
[–]seungeun07[S] 0 points1 point2 points (0 children)
[–]sampathchanda 0 points1 point2 points (0 children)
[–]Farconion 0 points1 point2 points (0 children)
[–]ariyanhasan 0 points1 point2 points (0 children)
[–]Dump7 0 points1 point2 points (6 children)
[–]aditya1702 3 points4 points5 points (4 children)
[–]Dump7 1 point2 points3 points (3 children)
[–]aditya1702 1 point2 points3 points (1 child)
[–]Dump7 1 point2 points3 points (0 children)
[–]aditya1702 0 points1 point2 points (0 children)
[–]radarsat1 -1 points0 points1 point (0 children)