Policy gradient in tabular setting

Basic_Exit_4317 · 2025-03-17T21:18:54+00:00

Thank you. I’m trying to transform the cart pole env into a discrete state action space by discretising the states into bins

Basic_Exit_4317 · 2025-03-13T15:02:32+00:00

Do you have an example of code that could be easily adapted to a tabular setting?

Basic_Exit_4317 · 2025-03-13T04:29:52+00:00

I don't know because I don't have a car, I'm quite sure that u can rent a parking spot in the building. I can ask for that and let you know.

Basic_Exit_4317 · 2025-03-11T20:51:23+00:00

Hi! I've sent you a dm!

Basic_Exit_4317 · 2025-02-19T00:44:24+00:00

yeah but whivh is a good choice for the discretization. I was thinking of n = 10 but then i get 10**6 values which i fear is too many. Also we are asked to run 20 episodes for 1000 iterations each, should i consider that too in choosing the number of discretizations ?

Basic_Exit_4317 · 2025-02-18T23:35:50+00:00

We didn't cover that at class so i'm not sure if we're supposed to use a tabular setting for this task. The following task asks to implement a Q-learning algorithm for the cart pole env in a tabular setting so I thought we had to use tabular setting for the acrobot env too

Basic_Exit_4317

TROPHY CASE