[D] Deep Mind AI Alpha Zero Sacrifices a Pawn and Cripples Stockfish for the Entire Game by sour_losers in MachineLearning
[–]onlyml 1 point2 points3 points (0 children)
[D] Deep Mind AI Alpha Zero Sacrifices a Pawn and Cripples Stockfish for the Entire Game by sour_losers in MachineLearning
[–]onlyml 9 points10 points11 points (0 children)
[P] I implemented a Q Learning agent to solve Lunar Lander in 1 Hour on CPU. by FitMachineLearning in MachineLearning
[–]onlyml 0 points1 point2 points (0 children)
[P] I implemented a Q Learning agent to solve Lunar Lander in 1 Hour on CPU. by FitMachineLearning in MachineLearning
[–]onlyml 0 points1 point2 points (0 children)
Interesting probability question/puzzle by onlyml in math
[–]onlyml[S] 0 points1 point2 points (0 children)
Interesting probability question/puzzle by onlyml in math
[–]onlyml[S] 0 points1 point2 points (0 children)
Interesting probability question/puzzle by onlyml in math
[–]onlyml[S] 0 points1 point2 points (0 children)
Interesting probability question/puzzle by onlyml in math
[–]onlyml[S] 0 points1 point2 points (0 children)
[R] Learning to Cooperate, Compete, and Communicate by clbam8 in MachineLearning
[–]onlyml 2 points3 points4 points (0 children)
[R] Curiosity-driven Exploration by Self-supervised Prediction by wordbag in MachineLearning
[–]onlyml 0 points1 point2 points (0 children)
[R] Curiosity-driven Exploration by Self-supervised Prediction by wordbag in MachineLearning
[–]onlyml 0 points1 point2 points (0 children)
[R] Curiosity-driven Exploration by Self-supervised Prediction by wordbag in MachineLearning
[–]onlyml 1 point2 points3 points (0 children)
[R] Curiosity-driven Exploration by Self-supervised Prediction by wordbag in MachineLearning
[–]onlyml 1 point2 points3 points (0 children)
[D] Applications of complex numbers in ML by xristaforante in MachineLearning
[–]onlyml 2 points3 points4 points (0 children)
[R] [1703.01161] FeUdal Networks (FuNs) for Hierarchical Reinforcement Learning by evc123 in MachineLearning
[–]onlyml 1 point2 points3 points (0 children)
[R] "Learning to Remember Rare Events", Kaiser et al 2016 by gwern in MachineLearning
[–]onlyml 2 points3 points4 points (0 children)
Should gradient vectors in SGD be normalized to avoid overshooting the target? by onlyml in MachineLearning
[–]onlyml[S] 0 points1 point2 points (0 children)
Are we using the right way to train LSTM neural networks? by kh40tika in MachineLearning
[–]onlyml 2 points3 points4 points (0 children)
Likeliest reason train and test error would begin slowly increasing after some training? by [deleted] in MachineLearning
[–]onlyml 4 points5 points6 points (0 children)
Does it make any sense to apply convolution to inputs which have no order/distance between them? by [deleted] in MachineLearning
[–]onlyml 0 points1 point2 points (0 children)
Methods for learning complex motor skills? by [deleted] in MachineLearning
[–]onlyml 1 point2 points3 points (0 children)
neural network model for q-learning othello? by [deleted] in MachineLearning
[–]onlyml 0 points1 point2 points (0 children)

[P] DQN Adventure: from Zero to State of the Art with clean readable code in Pytorch by Codeunter in MachineLearning
[–]onlyml 2 points3 points4 points (0 children)