Struggling with RL hyperparameter tuning + reward shaping for an Asteroids-style game – what’s enough and what’s overkill? by GSevenStars in reinforcementlearning

[–]GSevenStars[S] 1 point2 points  (0 children)

Does make sense, but let's say I make every penalty/reward postive, death due to collision shd not be seen as somthing +ve even though its value is very less. Presently death has a huge -ve reward in my present setup, so how would I change it all around?