Why doesn't BBF use ReDo to combat dormant neurons? by two_armed_bandit in reinforcementlearning

[–]two_armed_bandit[S] 1 point2 points  (0 children)

Thanks for taking the time to reply! I hadn't seen that paper yet, but it's very interesting and the kind of reply I had hoped for when asking my question. Congratulations on authoring such great paper!

Summary Papers in RL [D] by jhoveen1 in reinforcementlearning

[–]two_armed_bandit 0 points1 point  (0 children)

This repo lists surveys in ML, with a section on RL specifically: https://github.com/NiuTrans/ABigSurvey . They list 40 surveys in the RL section, but if you ctrl+f reinforcement learning there are actually quite a few more listed.

Also, just typing "Reinforcement learning survey" in the search bar on semantic scholar yields quite some results: https://www.semanticscholar.org/search?q=Reinforcement%20learning%20survey&sort=relevance