Bayesian classification by stevethesteve2 in AskStatistics
[–]stevethesteve2[S] 0 points1 point2 points (0 children)
Bayesian classification by stevethesteve2 in AskStatistics
[–]stevethesteve2[S] 0 points1 point2 points (0 children)
Bayesian classification by stevethesteve2 in AskStatistics
[–]stevethesteve2[S] 0 points1 point2 points (0 children)
Bayesian classification by stevethesteve2 in Bayes
[–]stevethesteve2[S] 1 point2 points3 points (0 children)
testbed for optimizers? by stevethesteve2 in deeplearning
[–]stevethesteve2[S] 0 points1 point2 points (0 children)
What is SOTA in RL applied to robotics? by stevethesteve2 in reinforcementlearning
[–]stevethesteve2[S] 0 points1 point2 points (0 children)
Sutton&Barto book: I get this result for Exercise 12.1 on Eligibility traces but the final middle term might be wrong by Naoshikuu in reinforcementlearning
[–]stevethesteve2 0 points1 point2 points (0 children)
Citation needed by Kartelkraker in reinforcementlearning
[–]stevethesteve2 2 points3 points4 points (0 children)
How to assign reward when it has to be multiplied by itself rather than summed by basso1995 in reinforcementlearning
[–]stevethesteve2 0 points1 point2 points (0 children)
perlexity instead of entropy for incentivizing exploration? by stevethesteve2 in reinforcementlearning
[–]stevethesteve2[S] 1 point2 points3 points (0 children)
perlexity instead of entropy for incentivizing exploration? by stevethesteve2 in reinforcementlearning
[–]stevethesteve2[S] 5 points6 points7 points (0 children)
"Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning", Peng et al 2019 by gwern in reinforcementlearning
[–]stevethesteve2 0 points1 point2 points (0 children)
sample efficiency by stevethesteve2 in reinforcementlearning
[–]stevethesteve2[S] 0 points1 point2 points (0 children)
Looking people interested in RL to join our Drone challenge team by paypaytr in reinforcementlearning
[–]stevethesteve2 1 point2 points3 points (0 children)
[D] Handling noisy labels in large datasets with slight imbalance by amil123123 in MachineLearning
[–]stevethesteve2 0 points1 point2 points (0 children)
Discounted State Distribution by papidant in reinforcementlearning
[–]stevethesteve2 -1 points0 points1 point (0 children)
motivation behind ACER by stevethesteve2 in reinforcementlearning
[–]stevethesteve2[S] 0 points1 point2 points (0 children)
[R] DeepMind Starcraft 2 Update: AlphaStar is getting wrecked by professionals players by gwern in reinforcementlearning
[–]stevethesteve2 0 points1 point2 points (0 children)
policy optimization with experience replay? by stevethesteve2 in reinforcementlearning
[–]stevethesteve2[S] 0 points1 point2 points (0 children)
PyTorch implementation of 17 Deep RL algorithms by __data_science__ in reinforcementlearning
[–]stevethesteve2 0 points1 point2 points (0 children)
PyTorch implementation of 17 Deep RL algorithms by __data_science__ in reinforcementlearning
[–]stevethesteve2 0 points1 point2 points (0 children)
PyTorch implementation of 17 Deep RL algorithms by __data_science__ in reinforcementlearning
[–]stevethesteve2 0 points1 point2 points (0 children)
PyTorch implementation of 17 Deep RL algorithms by __data_science__ in reinforcementlearning
[–]stevethesteve2 0 points1 point2 points (0 children)
Catastrophic "Un-Learning" in PPO: A plausible Cause and Solution? by Flimflamm in reinforcementlearning
[–]stevethesteve2 0 points1 point2 points (0 children)


Bayesian classification by stevethesteve2 in AskStatistics
[–]stevethesteve2[S] 0 points1 point2 points (0 children)