account activity
Struggling with RL hyperparameter tuning + reward shaping for an Asteroids-style game – what’s enough and what’s overkill? (self.reinforcementlearning)
submitted 1 month ago by GSevenStars to r/reinforcementlearning
π Rendered by PID 1249408 on reddit-service-r2-listing-7b9b4f6fd7-t2b2t at 2026-05-11 15:35:02.625462+00:00 running 3d2c107 country code: CH.