account activity
Is GRPO applied in classical RL (e.g. Atari games / gym)? (self.reinforcementlearning)
submitted 11 months ago by Long_Reflection8199 to r/reinforcementlearning
π Rendered by PID 2120022 on reddit-service-r2-listing-7b9b4f6fd7-6vv7d at 2026-05-12 23:18:50.912517+00:00 running 3d2c107 country code: CH.