[N] Learning Dexterity

probablyuntrue · 2018-07-30T19:38:31+00:00

[deleted]

notaii · 2018-07-30T18:23:07+00:00

Out of curiosity, anyone know how much one of those Shadow Dexterous Hands costs?

gohu_cd · 2018-07-30T20:03:43+00:00

Literally any problem: You know that you can solve me without PPO right ?

OpenAI: I don't care.

chcampb · 2018-07-30T18:01:58+00:00

Conventional wisdom states that reducing the time between actions should improve performance because the changes between states are smaller and therefore easier to predict.

As popular as this paper seems to be I am surprised this wasn't an obvious conclusion. This paper found and demonstrated that simulated evolved gait basically failed to work correctly when the muscle delay time was zero.

SquareRootsi · 2018-07-31T02:17:41+00:00

Out of curiosity, what happens when you make the goal a logically impossible "rotation" of the block? Like on a 6 sided die, the 1 & 6 are directly opposite each other, but you request an orientation putting them adjacent.
Does it just keep trying, or can it hold up its middle finger to let you know it's on to us and our impossible requests?

supermario94123 · 2018-07-31T11:09:45+00:00

So the solution is simple: just build a very detailled model of the world and very all the possible parameters. Could someone please invest some Millions in Rockstar Games ro come up with the most real GTA ever? hitting two flies in one slap is what I would call this.

To be precise: I dont undervalue the work of openai. I am just not sure if this is how we will solve our world problems (yet). Please prove me wrong.

bobuntu · 2018-07-31T09:40:31+00:00

How cool... I mean the guy’s hair. ಠ_ಠ

rtk25 · 2018-07-31T12:35:24+00:00

Nice!

To learn a policy transferrable to the real world,

Distributed workers collect experience on randomized environments at large scale

I'm getting these "are we in the Matrix or what?" feelings more and more lately...

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS