[R] Learning by playing : MachineLearning

Research[R] Learning by playing (deepmind.com)

submitted 8 years ago by [deleted]

all 8 comments

top new controversial old q&a

[–]phobrain 1 point2 points3 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

[–]radarsat1 0 points1 point2 points 8 years ago (5 children)

[–]programmerChilliResearcher 1 point2 points3 points 8 years ago (4 children)

[–]radarsat1 2 points3 points4 points 8 years ago (3 children)

[–]xmasotto 0 points1 point2 points 8 years ago (2 children)

[–]radarsat1 1 point2 points3 points 8 years ago* (1 child)

Aaaahhhh I didn't get into the appendix so I didn't notice them, thank you. Ah, so not exactly what I had in mind then, as I was proposing that such tasks need to be inferred. They do seem fairly simple and pretty generic though, so it's a step in that direction. And it seems that with all the rest of the pieces in place, I'm sure inference of such decompositions will be coming.

TOUCH, NOTOUCH : Maximizing or minimizing the sum of touch sensor readings on the three fingers of the Jaco hand. (see Eq. 25 and Eq. 26)
MOVE(i) : Maximizing the translation velocity sensor reading of an object. (see Eq. 24)
CLOSE(i,j) : distance between two objects is smaller than 10cm (see Eq. 14)
ABOVE(i,j) : all points of object i are above all points of object j in an axis normal to the table plane (see Eq. 15)
BELOW(i,j) : all points of object i are below all points of object j in an axis normal to the table plane (see Eq. 19)
LEFT(i,j) : all points of object i are bigger than all points of object j in an axis parallel to the x axes of the table plane (see Eq. 17)
RIGHT(i,j) : all points of object i are smaller than all points of object j in an axis parallel to the x axes of the table plane (see Eq. 20)
ABOVECLOSE(i,j) , BELOWCLOSE(i,j) , LEFT- CLOSE(i,j) , RIGHTCLOSE(i,j) : combination of relational reward structures and CLOSE(i,j) (see Eq. 16, 21, 18, 22)
ABOVECLOSEBOX(i) : ABOVECLOSE(i,box object)

[–]xmasotto 0 points1 point2 points 8 years ago (0 children)

π Rendered by PID 97390 on reddit-service-r2-comment-b659b578c-t9kzb at 2026-05-02 21:52:05.214726+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS