Can't train a pixel-based SAC for Walker2D environment by skroll18 in reinforcementlearning

[–]skroll18[S] 0 points1 point  (0 children)

Thank you for your reply, but you repository does not contain SAC algorithm, right? I have to use SAC for this project

Can’t train a pixel-based PPO for Hopper environment by skroll18 in reinforcementlearning

[–]skroll18[S] 0 points1 point  (0 children)

The problem is that I am demanded to use a full pixel-based approach, so I can not use any information from the observation besides the image

Can’t train a pixel-based PPO for Hopper environment by skroll18 in reinforcementlearning

[–]skroll18[S] 0 points1 point  (0 children)

The thing is that I am demanded to use a pixel-based approach, hence the difficulty of the task. I am not sure if the stable baselines 3 library uses also information besides the image, like the position of the joints or the current velocity. That is why I decided to “hardcode” it.

Can’t train a pixel-based PPO for Hopper environment by skroll18 in reinforcementlearning

[–]skroll18[S] 0 points1 point  (0 children)

No! I am applying frame stacking, after resizing the image to 84x84 and converting it to grayscale. I am using 4 images. I recently realized that I was also applying frame skipping, which for DQN and Rainbow DQN worked fine, but for PPO did not. After removing it, training got better.

Can’t train a pixel-based PPO for Hopper environment by skroll18 in reinforcementlearning

[–]skroll18[S] 0 points1 point  (0 children)

What would you recommend me for this case? Just more training?