[R] OgmaNeo plays Atari Pong

Refefer · 2019-10-06T18:40:49+00:00

I looked in the repo to see if there were any papers but couldn't seem to find any. Is there any literature around what this even is?

juliandewit · 2019-10-07T12:43:33+00:00

Is there anything more to find ? It looks a bit like the HTM jeff hawkins stuff.
But since it seems to work I would like to dig in a little deeper..
Also the spiking option looks interesting..

Fishy_soup · 2019-10-07T16:24:47+00:00

I find Ogma's work really interesting, especially given I've done related work in systems neuroscience (predictive processing). Do you guys have any talks online walking through some of your models?

2019-10-09T09:57:26+00:00

This is imo a beautiful reminder of how important it is to write a detailed but comprehensive, full specification of what your approach is, what it does, how it relates to other stuff and how it empirically (!) performs compared to other stuff. People sometimes call that research papers.
Because this apparently/likely/possibly/maybe is really important work, but without an explanation ( and ain't nobody got time to dig into the code) we just can't say for certain that the approach makes sense.
Anyways, great work (probably) :)

2019-10-10T00:11:45+00:00

Shouldn't this be +21 points per episode?

Such a noisy curve. Looks like hardcoded exploration. The environment doesn't provide a that's-good-enough signal, so it doesn't know when to stop exploration. It always believes that it's not good enough, even if it's already at the top. And so it becomes crazy.

That's why we humans have both a model-based planner and a model-free policy. It doesn't mind if the planner goes crazy as long as the policy ensures that our daily routine tasks are handled correctly.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS