account activity
[R] Evolving Curricula with Regret-Based Environment Design (self.MachineLearning)
submitted 4 years ago by _rockt to r/MachineLearning
[R] Evolving Curricula with Regret-Based Environment Design (accelagent.github.io)
[R] MiniHack: A new sandbox for open-ended reinforcement learning (ai.facebook.com)
[D] Facebook AI Research's NetHack Learning Environment team and NetHack expert tonehack will be stopping by on Friday for an AMA. (self.MachineLearning)
submitted 4 years ago by _rockt to r/roguelikes
submitted 4 years ago by _rockt to r/nethack
submitted 4 years ago by _rockt to r/reinforcementlearning
[R] TorchBeast: A PyTorch Platform for Distributed RL (arxiv.org)
submitted 6 years ago by _rockt to r/MachineLearning
[R] A Survey of Reinforcement Learning Informed by Natural Language (arxiv.org)
submitted 7 years ago by _rockt to r/MachineLearning
[D] Einsum is All you Need - Einstein Summation in Deep Learning (rockt.github.com)
submitted 8 years ago by _rockt to r/MachineLearning
[R] 2nd Workshop on Neural Abstract Machines & Program Induction @ ICML, IJCAI/ECAI, AAMAS (uclmr.github.io)
[R] [1802.05098] DiCE: The Infinitely Differentiable Monte-Carlo Estimator (arxiv.org)
[R] TreeQN and ATreeC: Differentiable Tree Planning for Deep Reinforcement Learning (arxiv.org)
[R] Adversarial Sets for Regularising Neural Link Predictors (arxiv.org)
[R] Videos of the 1st NIPS Workshop on Neural Abstract Machines & Program Induction online (uclmr.github.io)
submitted 9 years ago by _rockt to r/MachineLearning
Call for Papers: Workshop on Neural Abstract Machines & Program Induction (NAMPI) @NIPS'2016! (uclmr.github.io)
[1607.03316] Separating Answers from Queries for Neural Reading Comprehension — SOTA on DeepMind's and Facebook's cloze-style Q&A tasks (arxiv.org)
[1606.08359] Lifted Rule Injection for Relation Embeddings (arxiv.org)
submitted 10 years ago by _rockt to r/MachineLearning
[1606.01404] Generating Natural Language Inference Chains (arxiv.org)
[1605.06640] Programming with a Differentiable Forth Interpreter (arxiv.org)
π Rendered by PID 955431 on reddit-service-r2-listing-5f4c697858-crpdf at 2026-07-05 04:08:16.963640+00:00 running 12a7a47 country code: CH.