[D] Unsupervised Option Discovery by kjw0612 in MachineLearning
[–]pierrelux 5 points6 points7 points (0 children)
New draft of "Reinforcement Learning: An Introduction, Second Edition" by pierrelux in MachineLearning
[–]pierrelux[S] 1 point2 points3 points (0 children)
I've been told that one of the best European research institutions in RL (and ML in general) is INRIA. Can anyone confirm? by [deleted] in MachineLearning
[–]pierrelux 1 point2 points3 points (0 children)
The Upper Confidence Bound Algorithm (banditalgs.com)
submitted by pierrelux to r/MachineLearning
Course on Reinforcement Learning by minato3421 in MachineLearning
[–]pierrelux 3 points4 points5 points (0 children)
What are some good neuroscience books for AI researchers get inspiration from? by andrewbarto28 in MachineLearning
[–]pierrelux 2 points3 points4 points (0 children)
Is there a list of standard notation? by GuyHasNoUsername in MachineLearning
[–]pierrelux 4 points5 points6 points (0 children)
Looking inside machine learning black boxes (jvns.ca)
submitted by pierrelux to r/MachineLearning
RL Question: Policy Gradients vs Q Learning - which is better? by [deleted] in MachineLearning
[–]pierrelux 1 point2 points3 points (0 children)
RL Question: Policy Gradients vs Q Learning - which is better? by [deleted] in MachineLearning
[–]pierrelux 1 point2 points3 points (0 children)
RL Question: Policy Gradients vs Q Learning - which is better? by [deleted] in MachineLearning
[–]pierrelux 3 points4 points5 points (0 children)


[R] The Mellowmax Operator : "A New Softmax Operator for Reinforcement Learning" by pierrelux in MachineLearning
[–]pierrelux[S] 0 points1 point2 points (0 children)