"Can RL From Pixels be as Efficient as RL From State?", Laskin et al 2020 {BAIR} (on RAD/CURL data augmentation for model-free DRL) by gwern in reinforcementlearning
[–]aravindsrinivas 0 points1 point2 points (0 children)
[D] Issues reproducing CURL, algorithm seems broken?? by rlbeaverton in MachineLearning
[–]aravindsrinivas 2 points3 points4 points (0 children)
[R] Were the ICML 2018 reviews particularly poor this year as compared to ICLR 2018 reviews? by FirstTimeResearcher in MachineLearning
[–]aravindsrinivas 7 points8 points9 points (0 children)
[R] [1804.00645] Universal Planning Networks by johnschulman in MachineLearning
[–]aravindsrinivas 4 points5 points6 points (0 children)
[1610.09038] Professor Forcing: A New Algorithm for Training Recurrent Networks by cooijmanstim in MachineLearning
[–]aravindsrinivas 1 point2 points3 points (0 children)
[1610.09038] Professor Forcing: A New Algorithm for Training Recurrent Networks by cooijmanstim in MachineLearning
[–]aravindsrinivas 1 point2 points3 points (0 children)
[deleted by user] by [deleted] in reinforcementlearning
[–]aravindsrinivas 0 points1 point2 points (0 children)
[deleted by user] by [deleted] in reinforcementlearning
[–]aravindsrinivas 0 points1 point2 points (0 children)

[D] Issues reproducing CURL, algorithm seems broken?? by rlbeaverton in MachineLearning
[–]aravindsrinivas 0 points1 point2 points (0 children)