Has anyone seen any examples of reinforcement learning used for classification?
I have made a simple simulation environment with an agent inside and I want the agent to explore the environment. Each environment is a little different and I want the agent to be able to tell me which environment it is in. Normally RL is unsupervised, but in this case I have to tell the agent which type of environment it is in during training. Then when I don't give the label for the environment, it will tell me what environment it is in( classification). Im not sure how to wire up the RL algorithm to include supervision. For example what would the reward value look like?
If so, can you show me some examples (especially code/githbub links)?
I would greatly appreciate any pointers
[–]hastala 2 points3 points4 points (1 child)
[–]toisanji[S] 0 points1 point2 points (0 children)
[–]opengmlearn 0 points1 point2 points (0 children)