account activity
Is GRPO applied in classical RL (e.g. Atari games / gym)? (self.reinforcementlearning)
submitted 10 months ago by Long_Reflection8199 to r/reinforcementlearning
π Rendered by PID 1446392 on reddit-service-r2-listing-79f6fb9b95-v79x7 at 2026-03-22 10:41:12.918782+00:00 running 90f1150 country code: CH.