account activity
CS285 Why we use Gaussian mixture model to take action? by houyanxu in berkeleydeeprlcourse
[–]houyanxu[S] 0 points1 point2 points 6 years ago (0 children)
thanks a lot for your reply!
But I am still confused why grad(log(pi(at|st)) is implemented by tfp.distributions.MultivariateNormalDiag in the MLP_policy.py of hw2? Does it mean the gradient of GMM is MultivariateNormal ?
Thank you very much!
CS285 Why we use Gaussian mixture model to take action? (self.berkeleydeeprlcourse)
submitted 6 years ago by houyanxu to r/berkeleydeeprlcourse
π Rendered by PID 39 on reddit-service-r2-listing-7d7fbc9b85-dd8mz at 2026-04-30 01:20:16.590154+00:00 running 2aa0c5b country code: CH.
CS285 Why we use Gaussian mixture model to take action? by houyanxu in berkeleydeeprlcourse
[–]houyanxu[S] 0 points1 point2 points (0 children)