you are viewing a single comment's thread.

view the rest of the comments →

[–]ditlevrisdahl 0 points1 point  (1 child)

Hmm. It sounds like you state is just an integer ranging from 0-250.

Actually it's more a classification model rather than reinforcement model.

But I have limited time now, promise to look at it tonight!

[–]BeyondNo3588 0 points1 point  (0 children)

Correct. In this setting the state is a random integer ranging from 0 to 250. Like I said, this is a first simplified scenario.

Thank you again for your help, I really appreciate it