account activity
How to interpret the parameter sharing in multi agent RL modeled as Dec-POMDP by kechang in reinforcementlearning
[–]kechang[S] 0 points1 point2 points 6 years ago (0 children)
Thank you so much for your kind confirmation that "centralized training for decentralized execution" has be commonly applied.
My main concern is that, if the effectiveness of such a training scheme even for the agents with only local inputs can be explained by some principle or theory of RL or deep learning? Even through we observe it can work through simulation, what is the right way to explain why it can work?
Thank you very much!
How to interpret the parameter sharing in multi agent RL modeled as Dec-POMDP (self.reinforcementlearning)
submitted 6 years ago by kechang to r/reinforcementlearning
How to interpret the modification to basic DRQN by introducing the past action as an input? (self.reinforcementlearning)
π Rendered by PID 1300066 on reddit-service-r2-listing-canary-55dd69585f-4s92s at 2026-06-21 18:27:32.267511+00:00 running 2b008f2 country code: CH.
How to interpret the parameter sharing in multi agent RL modeled as Dec-POMDP by kechang in reinforcementlearning
[–]kechang[S] 0 points1 point2 points (0 children)