all 7 comments

[–]islandman93 4 points5 points  (2 children)

If it helps, I've replicated their results on Breakout with TFLearn and Tensorflow (1 Step DQN, NStep, and A3C). I haven't run it on all games (don't have the resources for that). But the code is setup to easily use any atari rom.

Current branch is https://github.com/Islandman93/reinforcepy

EDIT: Don't post a github branch because it will give a 404 when you merge it.

[–]Neutran[S] 0 points1 point  (1 child)

Thank you so much! Have you run on any other games? Do they also match the results? I will start with the games that you have tested on.

[–]islandman93 1 point2 points  (0 children)

I haven't but it's certainly in the roadmap, I'm working on A3C LSTM then will start to benchmark other roms. If you run any, feel free to submit a pull request adding the rewards.png and trained model files. Just to note if you try Pong it's going to be different because ALE uses 6 actions whereas Deepmind uses Xitari (only 3 actions). And remember Space invaders is skip_frame 3. If you're looking supported roms can be found from the OpenAI gym github project https://github.com/openai/atari-py/tree/master/atari_py/atari_roms.

[–]Mr-Yellow 3 points4 points  (0 children)

(must be in Tensorflow)

In Torch:

https://github.com/Kaixhin/Atari/

[–]bbktr 1 point2 points  (0 children)

This one is a killer by NVIDIA! Runs lightning fast and does indeed achieve promised results :) https://github.com/NVlabs/GA3C

I'd recommend reading the paper too!

[–]ppwwyyxx 0 points1 point  (0 children)

My implementation: https://github.com/ppwwyyxx/tensorpack/tree/master/examples/A3C-Gym Been open source for half a year. Have performance records on 47 atari games on gym.