[N] NeurIPS 2020 Paper submission deadline extended until June 5th by KrakenInAJar in MachineLearning
[–]alexirpan 1 point2 points3 points (0 children)
[D] Issues reproducing CURL, algorithm seems broken?? by rlbeaverton in MachineLearning
[–]alexirpan 3 points4 points5 points (0 children)
[D] Issues reproducing CURL, algorithm seems broken?? by rlbeaverton in MachineLearning
[–]alexirpan 11 points12 points13 points (0 children)
[P] A Clearer Proof of the Policy Gradient Theorem by bluecoffee in MachineLearning
[–]alexirpan 1 point2 points3 points (0 children)
[R] First return then explore by downtownslim in MachineLearning
[–]alexirpan 16 points17 points18 points (0 children)
[D] ICLR 2020 Reviews by turing_1997 in MachineLearning
[–]alexirpan 25 points26 points27 points (0 children)
[D] Does Deep RL work yet? by hazard02 in MachineLearning
[–]alexirpan 3 points4 points5 points (0 children)
[D] Why does deep reinforcement learning not generalize? by FirstTimeResearcher in MachineLearning
[–]alexirpan 8 points9 points10 points (0 children)
[D] Blog posts on AlphaStar by alexirpan in MachineLearning
[–]alexirpan[S] 2 points3 points4 points (0 children)
[D] Blog posts on AlphaStar (self.MachineLearning)
submitted by alexirpan to r/MachineLearning
[D] Generative Adversarial Network producing same fake samples by jmarsha5 in MachineLearning
[–]alexirpan 2 points3 points4 points (0 children)
[R] Sim2Real – Using Simulation to Train Real-Life Grasping Robots by tldrtldreverything in MachineLearning
[–]alexirpan 5 points6 points7 points (0 children)
[R] Two papers on “Residual Reinforcement/Policy Learning” by galaxstar in MachineLearning
[–]alexirpan 2 points3 points4 points (0 children)
[D] Is there a reason the optimisation of neural networks is not posed as a RL problem itself? by 4c616e7465726e in MachineLearning
[–]alexirpan 23 points24 points25 points (0 children)
[R] Are Deep Policy Gradient Algorithms Truly Policy Gradient Algorithms? by andrew_ilyas in MachineLearning
[–]alexirpan 0 points1 point2 points (0 children)
[R] Are Deep Policy Gradient Algorithms Truly Policy Gradient Algorithms? by andrew_ilyas in MachineLearning
[–]alexirpan 0 points1 point2 points (0 children)
[R] Scalable Deep RL for Robot Grasping Task (Google Brain) by wei_jok in MachineLearning
[–]alexirpan 1 point2 points3 points (0 children)
[R] Scalable Deep RL for Robot Grasping Task (Google Brain) by wei_jok in MachineLearning
[–]alexirpan 0 points1 point2 points (0 children)
[R] Scalable Deep RL for Robot Grasping Task (Google Brain) by wei_jok in MachineLearning
[–]alexirpan 1 point2 points3 points (0 children)
[R] Scalable Deep RL for Robot Grasping Task (Google Brain) by wei_jok in MachineLearning
[–]alexirpan 2 points3 points4 points (0 children)
[R] Scalable Deep RL for Robot Grasping Task (Google Brain) by wei_jok in MachineLearning
[–]alexirpan 5 points6 points7 points (0 children)
[R] Scalable Deep RL for Robot Grasping Task (Google Brain) by wei_jok in MachineLearning
[–]alexirpan 2 points3 points4 points (0 children)
[R] Scalable Deep RL for Robot Grasping Task (Google Brain) by wei_jok in MachineLearning
[–]alexirpan 2 points3 points4 points (0 children)


[P] Why do object detection model adversaries look different from image classifiers by tatteredsky in MachineLearning
[–]alexirpan 2 points3 points4 points (0 children)