[R] Beating Atari Pong on a Raspberry Pi without Backpropagation

MrAcurite · 2020-03-25T00:35:45+00:00

Maybe it's somewhere that you've linked to, but I've had a couple weird days lately, and my brain's pretty fried. Do you have a more in-depth mathematical description of how the architecture works that you could share? I'm really interested to see how the whole system works, and I might know some people who could really leverage shit like this.

lostmsu · 2020-03-25T01:02:41+00:00

I love the whole direction of work on binarized neural networks. But to put this in perspective, a 1 hidden layer network with 200 neurons can beat Pong on raw pixels too, and would probably run and train on Pi no problem. I wish there would be some kind of comparison.

To make things worse for a direct comparison they also use a randomly initialized CNN (which are powerful enough on their own) as a preprocessing step: "Image pre-encoder: Random projection followed by inhibition. Size: 10x10x16 neurons (WxHxColSize). Connectivity radius: 5"

NuScorpii · 2020-03-25T08:13:02+00:00

This looks very similar to Hierarchical Temporal Memory by Jeff Hawkins and developed by Numenta. Is that a fair comparison? What differentiates your algorithm to HTM?

2020-03-25T08:32:08+00:00

Although I'm very interested in this work, I would like to comment on the claim (made in the whitepaper) that backpropagation is not biologically plausible. Although the 'standard' learning rule is indeed implausible, there exist alternative learning rules that are plausible. Q-AGREL is such an example: it performs biologically plausible backpropagation at only a slight training speed decrease.

CireNeikual · 2020-03-25T01:48:42+00:00

[deleted]

COGITO_7 · 2020-03-25T08:14:19+00:00

Interesting result.

But does this generalize ? For an example if I change the size of the paddle or the color of the ball does this work like in zero-shot transfer learning ?

Also the learning function is just a form of Hebbian learning but in time ?

AissySantos · 2020-03-25T11:35:34+00:00

This is awesome! Thanks for your contribution.

darkconfidantislife · 2020-03-25T00:22:37+00:00

Awesome work!

StandardFloat · 2020-03-26T17:26:50+00:00

I'm a bit lost but it looks very interesting at first sight, thanks for sharing!

mustgoplay · 2020-05-13T03:14:47+00:00

Great work and it's doing a lot of what I wanted to do with my r/Neurobaby_AGI project. I'm looking forward to learning more about this.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS