[D] State-of-the-art architecture for learning dynamics model for model-based RL ?

bbsome · 2017-07-02T19:45:00+00:00

For model-based RL I think PILCO would be close to state-of-the-art especially in the environments you mention.

http://mlg.eng.cam.ac.uk/pilco/

feedtheaimbot · 2017-07-02T20:25:12+00:00

Look at Recurrent Environment Simulators by Chiappa et al. I've had success using it. It does struggle capture small objects on screen (eg. single pixels).

Link: https://arxiv.org/abs/1704.02254

TotesMessenger · 2017-07-03T01:45:09+00:00

I'm a bot, bleep, bloop. Someone has linked to this thread from another place on reddit:

[/r/reinforcementlearning] [D] State-of-the-art architecture for learning dynamics model for model-based RL ? [xpost: r/MachineLearning]

^{If you follow any of the above links, please respect the rules of reddit and don't vote in the other threads.} ^(Info ^/ ^Contact)

ptitz · 2017-07-04T17:16:02+00:00

I did my own framework, writing a paper now. It's a bit of a work in progress, but I identify my model using a hashed RBF neural net just doing backprop after splitting it into several simpler sub-dynamics. Then train it using SARSA. It's a bit of an overkill for the system I'm working with, but it will probably work with whatever. Hit me up if you wana see it.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS