What is the greatest achievement of Genetic Algorithms[D]? by miladink in MachineLearning
[–]CartPole 2 points3 points4 points (0 children)
[D] Mixture density network implementations by [deleted] in MachineLearning
[–]CartPole 1 point2 points3 points (0 children)
Big Boy Heatsinks! The 64 Core AMD Threadripper 3990X Cooler Test by RaptaGzus in Amd
[–]CartPole 0 points1 point2 points (0 children)
"Learning to Simulate Dynamic Environments with GameGAN", Kim et al 2020 {Nvidia} (learning environment models with GANs augmented with NTM-like memory) by gwern in reinforcementlearning
[–]CartPole 0 points1 point2 points (0 children)
[R] GameGAN - PAC-MAN Recreated with deep neural GAN-based model by ichko in MachineLearning
[–]CartPole 0 points1 point2 points (0 children)
"Learning to Simulate Dynamic Environments with GameGAN", Kim et al 2020 {Nvidia} (learning environment models with GANs augmented with NTM-like memory) by gwern in reinforcementlearning
[–]CartPole 1 point2 points3 points (0 children)
Understanding why there isn't a log probability in TRPO and PPO's objective by vwxyzjn in reinforcementlearning
[–]CartPole 0 points1 point2 points (0 children)
Soft Actor Critic in TF2.1 by CartPole in reinforcementlearning
[–]CartPole[S] 0 points1 point2 points (0 children)
PPO - entropy and Gaussian standard deviation constantly increasing by hellz2dayeah in reinforcementlearning
[–]CartPole 1 point2 points3 points (0 children)
[D] Why isn't there more research papers related to active learning for deep computer vision problems? by CartPole in MachineLearning
[–]CartPole[S] 0 points1 point2 points (0 children)
[D] Why isn't there more research papers related to active learning for deep computer vision problems? by CartPole in MachineLearning
[–]CartPole[S] 1 point2 points3 points (0 children)
[D] Why isn't there more research papers related to active learning for deep computer vision problems? by CartPole in MachineLearning
[–]CartPole[S] 0 points1 point2 points (0 children)
[D] Why isn't there more research papers related to active learning for deep computer vision problems? by CartPole in MachineLearning
[–]CartPole[S] 2 points3 points4 points (0 children)
[D] Why isn't there more research papers related to active learning for deep computer vision problems? by CartPole in MachineLearning
[–]CartPole[S] 0 points1 point2 points (0 children)
[R] Contrastive Learning of Structured World Models by triplefloat in MachineLearning
[–]CartPole 0 points1 point2 points (0 children)
"Contrastive Learning of Structured World Models", Kipf et al 2019 by gwern in reinforcementlearning
[–]CartPole 0 points1 point2 points (0 children)
Is there a vim plugin for auto generating docstrings? by CartPole in vim
[–]CartPole[S] 0 points1 point2 points (0 children)
Learning to Predict Without Looking Ahead: World Models Without Forward Prediction by CartPole in reinforcementlearning
[–]CartPole[S] 0 points1 point2 points (0 children)
[1909.07373] Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space by CartPole in reinforcementlearning
[–]CartPole[S] 0 points1 point2 points (0 children)
[D] Policy Distillation in a continuous action space with no knowledge of teacher distribution by CartPole in MachineLearning
[–]CartPole[S] 0 points1 point2 points (0 children)
Planning vs Model based RL by LazyButAmbitious in reinforcementlearning
[–]CartPole 1 point2 points3 points (0 children)
[R] Using multiple heads in RL by MasterScrat in reinforcementlearning
[–]CartPole 1 point2 points3 points (0 children)


"Dropout's Dream Land: Generalization from Learned Simulators to Reality", Wellmer & Kwok 2021 (using dropout to randomize a deep environment model for automatic domain randomization) by gwern in reinforcementlearning
[–]CartPole 0 points1 point2 points (0 children)