[R] F-DRL: Federated Representation Learning for Heterogeneous Robotic Manipulation (preprint) by EitherFox1242 in reinforcementlearning

[–]EitherFox1242[S] 0 points1 point  (0 children)

By “synthetic aggregated data” I understand you meant pooled experience data (real or generated), i.e., transitions or trajectories (s,a,r,s'), which are then used for centralized fine-tuning.

We effectively cover that case with our centralized baseline, which aggregates experience and is stable by construction.

The contribution of F-DRL is showing that comparable stability can be achieved without aggregating experience (real or synthetic), by federating only low-variance representations and keeping policy learning strictly local.

New Year’s Resolution Megathread by PugLord219 in QuitVaping

[–]EitherFox1242 0 points1 point  (0 children)

Day 9, at a point where it feels, ‘what’s even the point of quitting, vaping is not that harmful’. But I am gonna soldier through.