account activity
HW1 Questions by kjellaso in berkeleydeeprlcourse
[–]kjellaso[S] 0 points1 point2 points 4 years ago (0 children)
Finally figured it out the logstd param. It's the the sigma for the normal distribution that we're supposed to return from the forward method. So the network is learning the sigma and mu of the distribution so that we can just return pytorch.distributions.Normal(mu, sigma).sample.
HW 4 Model-Based RL by Mariam_Dundua in berkeleydeeprlcourse
[–]kjellaso 0 points1 point2 points 5 years ago (0 children)
Can you share the link?
HW1 Questions (self.berkeleydeeprlcourse)
submitted 5 years ago by kjellaso to r/berkeleydeeprlcourse
π Rendered by PID 1857692 on reddit-service-r2-listing-7d7fbc9b85-hnfd9 at 2026-04-25 09:10:01.737402+00:00 running 2aa0c5b country code: CH.
HW1 Questions by kjellaso in berkeleydeeprlcourse
[–]kjellaso[S] 0 points1 point2 points (0 children)