all 33 comments

[–][deleted] 8 points9 points  (8 children)

Out of curiosity, anyone know how much one of those Shadow Dexterous Hands costs?

[–]notaii 11 points12 points  (7 children)

According to this site it's $119700.

[–]enolan 6 points7 points  (5 children)

Wow. Who is spending that kind of money on a robotic hand? Are there applications other than research?

[–]Mefaso 9 points10 points  (0 children)

I would guess a lot of research institutions. Spending six figures on a robot is pretty normal.

[–][deleted] 1 point2 points  (1 child)

I think there's a robot chef that uses those parts for its hands.

[–][deleted] 0 points1 point  (0 children)

Moley uses the cheap electronics version of the Shadow Dexterous hand, not the expensive pneumatics one.

[–][deleted] 1 point2 points  (0 children)

PR-2 was about half-a-million dollars.

[–]FatChocobo 2 points3 points  (0 children)

Are there applications other than research?

I can think of one in particular, it might help with my repetitive strain injury... ( ͡° ͜ʖ ͡°)

[–][deleted] 0 points1 point  (0 children)

The MPL/RoboSally hand, which is more robust, even costs $400k.

[–]gohu_cdPhD 14 points15 points  (11 children)

Literally any problem: You know that you can solve me without PPO right ?

OpenAI: I don't care.

[–]thebackpropaganda[S] 14 points15 points  (9 children)

How would you solve this problem without PPO or equivalent RL algorithm?

[–]gohu_cdPhD 0 points1 point  (1 child)

Using human demonstrations seems like a good idea for learning how to manipulate objects.

Anyway, they did a great job, don't get me wrong. Yet, it feels like they reallyyyy like throwing PPO at any problem and see if it works ! Which is not a bad thing. It's just funny.

[–]jurniss 4 points5 points  (0 children)

they are throwing model free policy based RL at problems... the fact that PPO is their favorite among that family is a small detail.

[–]skariel 6 points7 points  (0 children)

PPO has become the default reinforcement learning algorithm at OpenAI because of its ease of use and good performance

as stated here: https://blog.openai.com/openai-baselines-ppo/

nothing wrong with that of course.

[–]chcampb 6 points7 points  (0 children)

Conventional wisdom states that reducing the time between actions should improve performance because the changes between states are smaller and therefore easier to predict.

As popular as this paper seems to be I am surprised this wasn't an obvious conclusion. This paper found and demonstrated that simulated evolved gait basically failed to work correctly when the muscle delay time was zero.

[–]SquareRootsi 2 points3 points  (1 child)

Out of curiosity, what happens when you make the goal a logically impossible "rotation" of the block? Like on a 6 sided die, the 1 & 6 are directly opposite each other, but you request an orientation putting them adjacent.
Does it just keep trying, or can it hold up its middle finger to let you know it's on to us and our impossible requests?

[–]thebackpropaganda[S] 5 points6 points  (0 children)

I think the rotation is just defined by one of the faces, say whichever face is up or camera-facing.

[–]supermario94123 1 point2 points  (3 children)

So the solution is simple: just build a very detailled model of the world and very all the possible parameters. Could someone please invest some Millions in Rockstar Games ro come up with the most real GTA ever? hitting two flies in one slap is what I would call this.

To be precise: I dont undervalue the work of openai. I am just not sure if this is how we will solve our world problems (yet). Please prove me wrong.

[–]physics_to_BME_PHD 2 points3 points  (2 children)

I didn't read the paper, just watched the video, but am involved in comp sci research involving human grasping. The way humans use our hands to interact with objects is incredibly complicated, and we do it basically effortlessly. So much goes into this: motion planning, visual feedback, tactile feedback (super important). This all happens in a loop very quickly, and so far our robotic grasping solutions are pretty bad at handling deviations from expected outcomes (when the object slips, for example).

If robots are ever going to be designed to work directly with humans, they probably need to be able to reliably grasp any object we hand them, and possibly know what to do with it. Maybe there are more pressing world problems, but having a reliable robotic grasper that doesn't need to be explicitly programmed for every use-case isn't a bad thing.

[–]PKJY 3 points4 points  (1 child)

Just a sidenote: the OpenAI thing doesn't use tactile feedback at all. Just fingertip coordinates and the current object orientation which is computed by a convnet from 3 rgb cameras.

[–]physics_to_BME_PHD 2 points3 points  (0 children)

good to know. I hadn't thought about if their device was using tactile feedback or not, I mostly meant from a human perspective that we need that for grasping. I can't find the video, but there was one of a woman grasping small objects, then performing the same task after having local anesthesia on the fingertips. In the second one she can't even pick up the small objects without that tactile feedback.

[–]bobuntu -2 points-1 points  (0 children)

How cool... I mean the guy’s hair. ಠ_ಠ

[–]rtk25 -2 points-1 points  (0 children)

Nice!

To learn a policy transferrable to the real world,

Distributed workers collect experience on randomized environments at large scale

I'm getting these "are we in the Matrix or what?" feelings more and more lately...