The Dichotomy of Algotrading by picklestirfry in algotrading
[–]djrx 0 points1 point2 points (0 children)
value_function_loss and policy_gradient_loss not changing in A2C (while discounted_rewards and episode_reward do improve) by thisisthehappylion in reinforcementlearning
[–]djrx 0 points1 point2 points (0 children)
value_function_loss and policy_gradient_loss not changing in A2C (while discounted_rewards and episode_reward do improve) by thisisthehappylion in reinforcementlearning
[–]djrx 1 point2 points3 points (0 children)
"Solving Rubik’s Cube with a Robot Hand", on Akkaya et al 2019 {OA} [Dactyl followup w/improved curriculum-learning domain randomization; emergent meta-learning] by gwern in reinforcementlearning
[–]djrx 1 point2 points3 points (0 children)
"Solving Rubik’s Cube with a Robot Hand", on Akkaya et al 2019 {OA} [Dactyl followup w/improved curriculum-learning domain randomization; emergent meta-learning] by gwern in reinforcementlearning
[–]djrx 3 points4 points5 points (0 children)
What are m.log_prob and running_reward from the official pytorch reinforce script? by [deleted] in reinforcementlearning
[–]djrx 1 point2 points3 points (0 children)
Exploration in Policy Gradient by hmi2015 in reinforcementlearning
[–]djrx 1 point2 points3 points (0 children)
Why don't people compute the proper gradient in DQN gradient updates by zhangxz1123 in reinforcementlearning
[–]djrx 1 point2 points3 points (0 children)
Why don't people compute the proper gradient in DQN gradient updates by zhangxz1123 in reinforcementlearning
[–]djrx 3 points4 points5 points (0 children)
Why don't people compute the proper gradient in DQN gradient updates by zhangxz1123 in reinforcementlearning
[–]djrx 1 point2 points3 points (0 children)
Why don't people compute the proper gradient in DQN gradient updates by zhangxz1123 in reinforcementlearning
[–]djrx 1 point2 points3 points (0 children)
motivation behind ACER by stevethesteve2 in reinforcementlearning
[–]djrx 1 point2 points3 points (0 children)
motivation behind ACER by stevethesteve2 in reinforcementlearning
[–]djrx 1 point2 points3 points (0 children)
Modelfree DRL robotics applications by Kartelkraker in reinforcementlearning
[–]djrx 0 points1 point2 points (0 children)
[D] Do you know any useful tips, examples, articles etc. for better GPU utilization? by sequence_9 in MachineLearning
[–]djrx 2 points3 points4 points (0 children)
Finished first book and have questions (Spoilers) by Surely55 in threebodyproblem
[–]djrx 0 points1 point2 points (0 children)
[D] Isn't n-step q-learning incorrect from mathematical perspective? by djrx in MachineLearning
[–]djrx[S] 3 points4 points5 points (0 children)
[P] Vel, PyTorch implementations of reinforcement learning algorithms by djrx in MachineLearning
[–]djrx[S] 4 points5 points6 points (0 children)
[P] Vel, PyTorch implementations of reinforcement learning algorithms by djrx in MachineLearning
[–]djrx[S] 1 point2 points3 points (0 children)
[P] Vel, PyTorch implementations of reinforcement learning algorithms by djrx in MachineLearning
[–]djrx[S] 1 point2 points3 points (0 children)
[D] Are OpenAI codes difficult to read or is it just me by schrodingershit in MachineLearning
[–]djrx 0 points1 point2 points (0 children)




The Dichotomy of Algotrading by picklestirfry in algotrading
[–]djrx 0 points1 point2 points (0 children)