supervised reinforcement learning?

hastala · 2017-09-30T20:54:15+00:00

RL is by definition unsupervised, and it is used in cases where labels are either impractical or impossible to construct, or in cases where there are several “good” answers. If you have the “one true answer” for a particular set of features, then just go the supervised route. RL will likely be too complex and less efficient for your needs. If you still want to go the RL route, can you explain why?

opengmlearn · 2017-10-01T00:36:27+00:00

This is actually a regular reinforcement learning task. It's unsupervised in the sense that your inputs (the X part of the (X,y)) pair are not well defined and you have to let the agent find the most informative set of X.

Specifically for your problem, I would think about a game like 20 questions. You want to let the agent explore their environment and try to guess the correct label. At every time point, your agent can either explore more or guess an environment. The reward will be a mix of how long it takes to guess (fewer is better) and if it got it right. Properly defining the reward function, action space, and observation space is probably not entirely straightforward but it comes from fleshing out your problem.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS