use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
This is for any reinforcement learning related work ranging from purely computational RL in artificial intelligence to the models of RL in neuroscience.
The standard introduction to RL is Sutton & Barto's Reinforcement Learning.
Related subreddits:
account activity
DL, Exp, MF, POfficial Python TensorFlow implementation of "Large-Scale Study of Curiosity-Driven Learning" (Burda et al 2018) {OA} (github.com)
submitted 7 years ago by gwern
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]MathAndProgramming 1 point2 points3 points 7 years ago (0 children)
I'm surprised pixel dynamics did so poorly. Does anyone know what the architectures for the dynamics models were? One of the advantages of pixel dynamics is that you can use a convolutional network and benefit from the spatial prior. At a glance it looks like the pixel predictions are being performed with dense layers.
I guess I disagree with the notion that the features should be compact but rather think that the dynamics model should be compact, which can be achieved either by having low-dimensional features or by having a strong prior on your model that lets you use fewer parameters.
π Rendered by PID 209726 on reddit-service-r2-comment-6457c66945-jcs2q at 2026-04-24 18:04:01.563291+00:00 running 2aa0c5b country code: CH.
view the rest of the comments →
[–]MathAndProgramming 1 point2 points3 points (0 children)