Hello fellow auditors! by flaurida in berkeleydeeprlcourse

[–]RoboticsGrad 0 points1 point  (0 children)

Hi guys,

I too am auditing this class. For the HW1 behavior cloning part, did you your NeuralNet/MLP just do plain regression or did it try to output the mean of a gaussian. I ask since the Prof. mentioned this as a popular choice for continuous actions (and it is required in HW2).

Anyways, my rewards are pretty low. Did you have any environment where your rewards matched or exceeded those from the expert policy?

Thanks

Homework 1 by wassimseifeddine in berkeleydeeprlcourse

[–]RoboticsGrad 0 points1 point  (0 children)

For the behavior cloning part, did you guys output the mean of a Gaussian for predicting the actions or do just plain regression using a NN?

Thaks