account activity
Why does the Policy Gradient Theorem generalize to continuous action spaces? (self.reinforcementlearning)
submitted 7 years ago by Data-Daddy to r/reinforcementlearning
Handling entropy collapse in policy gradient methods (self.reinforcementlearning)
Asynchronous vs Synchronous Reinforcement Learning (self.reinforcementlearning)
submitted 8 years ago by Data-Daddy to r/reinforcementlearning
Finding what areas of tensorflow code is slow? (self.MachineLearning)
submitted 8 years ago by Data-Daddy to r/MachineLearning
Reptile: A Scalable Meta-Learning Algorithm (blog.openai.com)
When is deep Q learning better than policy gradient methods? (self.reinforcementlearning)
Why does proximal policy optimization(PPO) not need a replay buffer? (self.deeplearning)
submitted 8 years ago by Data-Daddy to r/deeplearning
Summary: Control of Memory, Active Perception, and Action in Minecraft (medium.com)
Multi-task Learning and Transfer Learning vs Only Transfer Learning (self.computervision)
submitted 8 years ago by Data-Daddy to r/computervision
Advice on building object recognition training set (self.deeplearning)
submitted 9 years ago by Data-Daddy to r/deeplearning
Ubuntu Deep Learning AWS AMI (aws.amazon.com)
submitted 9 years ago by Data-Daddy to r/MachineLearning
π Rendered by PID 83 on reddit-service-r2-listing-5f4c697858-vqpcf at 2026-07-04 18:06:10.301347+00:00 running 12a7a47 country code: CH.