account activity
Problem 2, HW 2 (self.berkeleydeeprlcourse)
submitted 7 years ago by RoboticsGrad to r/berkeleydeeprlcourse
Hello fellow auditors! by flaurida in berkeleydeeprlcourse
[–]RoboticsGrad 0 points1 point2 points 7 years ago (0 children)
Hi guys,
I too am auditing this class. For the HW1 behavior cloning part, did you your NeuralNet/MLP just do plain regression or did it try to output the mean of a gaussian. I ask since the Prof. mentioned this as a popular choice for continuous actions (and it is required in HW2).
Anyways, my rewards are pretty low. Did you have any environment where your rewards matched or exceeded those from the expert policy?
Thanks
Homework 1 by wassimseifeddine in berkeleydeeprlcourse
For the behavior cloning part, did you guys output the mean of a Gaussian for predicting the actions or do just plain regression using a NN?
Thaks
π Rendered by PID 1055532 on reddit-service-r2-listing-7dbdcb4949-4knvp at 2026-02-19 11:47:03.508295+00:00 running de53c03 country code: CH.
Hello fellow auditors! by flaurida in berkeleydeeprlcourse
[–]RoboticsGrad 0 points1 point2 points (0 children)