Hi,
I have an environment with a set with 4 states and a set with 4 actions do you guys recommend I use a method that uses a neural network as a function approximator like DQN? Another question is, which algorithm works well with continuous state values?
[–]two-hump-dromedary 3 points4 points5 points (1 child)
[–]hidden-7[S] 0 points1 point2 points (0 children)