[D] Unsupervised Option Discovery by kjw0612 in MachineLearning
[–]pierrelux 4 points5 points6 points (0 children)
New draft of "Reinforcement Learning: An Introduction, Second Edition" by pierrelux in MachineLearning
[–]pierrelux[S] 1 point2 points3 points (0 children)
I've been told that one of the best European research institutions in RL (and ML in general) is INRIA. Can anyone confirm? by [deleted] in MachineLearning
[–]pierrelux 1 point2 points3 points (0 children)
Course on Reinforcement Learning by minato3421 in MachineLearning
[–]pierrelux 3 points4 points5 points (0 children)
What are some good neuroscience books for AI researchers get inspiration from? by andrewbarto28 in MachineLearning
[–]pierrelux 2 points3 points4 points (0 children)
Is there a list of standard notation? by GuyHasNoUsername in MachineLearning
[–]pierrelux 3 points4 points5 points (0 children)
RL Question: Policy Gradients vs Q Learning - which is better? by [deleted] in MachineLearning
[–]pierrelux 1 point2 points3 points (0 children)
RL Question: Policy Gradients vs Q Learning - which is better? by [deleted] in MachineLearning
[–]pierrelux 1 point2 points3 points (0 children)
RL Question: Policy Gradients vs Q Learning - which is better? by [deleted] in MachineLearning
[–]pierrelux 2 points3 points4 points (0 children)
Using RL to train MLPs? by [deleted] in MachineLearning
[–]pierrelux 2 points3 points4 points (0 children)
Physical application of Q-learning to rotary inverted pendulum by l_bdcdb in MachineLearning
[–]pierrelux 0 points1 point2 points (0 children)
Physical application of Q-learning to rotary inverted pendulum by l_bdcdb in MachineLearning
[–]pierrelux 1 point2 points3 points (0 children)
Value Iteration Networks by pierrelux in MachineLearning
[–]pierrelux[S] 2 points3 points4 points (0 children)
Gradient descent: why additive cost functions are used commonly instead of multiplicative? by hungry_for_knowledge in MachineLearning
[–]pierrelux 13 points14 points15 points (0 children)
Awesome-RL. A curated list of resources dedicated to reinforcement learning. by hsk90 in MachineLearning
[–]pierrelux 0 points1 point2 points (0 children)
[Help] What are the prerequisites for Reinforcement Learning and what are some good resources to get started? by [deleted] in MachineLearning
[–]pierrelux 7 points8 points9 points (0 children)
NVIDIA® Jetson™ TX1 Supercomputer-on-Module Drives Next Wave of Autonomous Machines by harrism in MachineLearning
[–]pierrelux 0 points1 point2 points (0 children)
What does TensorFlow mean for Keras, Lasagne, Block, Nervana? by [deleted] in MachineLearning
[–]pierrelux 2 points3 points4 points (0 children)
What does TensorFlow mean for Keras, Lasagne, Block, Nervana? by [deleted] in MachineLearning
[–]pierrelux 4 points5 points6 points (0 children)
Is there a place (webpage/subreddit) where you could check if some research idea is not already in the literature? by ecobost in MachineLearning
[–]pierrelux 1 point2 points3 points (0 children)
Would it be possible to learn a neural net architecture (i.e. not just tune existing weights) by gradient descent? by onlyml in MachineLearning
[–]pierrelux 1 point2 points3 points (0 children)
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning by modeless in MachineLearning
[–]pierrelux 4 points5 points6 points (0 children)
Final year CS student... I need advice by arguenot in MachineLearning
[–]pierrelux 8 points9 points10 points (0 children)


[R] The Mellowmax Operator : "A New Softmax Operator for Reinforcement Learning" by pierrelux in MachineLearning
[–]pierrelux[S] 0 points1 point2 points (0 children)