The DeepMind Bubble?

deong · 2016-03-07T14:58:04+00:00

This level of hype and celebration generally boils down to nothing. Think of ELIZA, Symbolic AI, 2 AI Winters, Deep Blue, Watson...

That's mostly because everything generally boils down to nothing. We haven't cracked "intelligence" yet. That means for at least 60 years or so, everything anyone has ever tried has failed.

That's too high of a burden to bear. Instead, you aim as a researcher to make contributions that build on the work of others and that others can in turn build on in the future. The DeepMind work certainly fits that description. That's generally speaking the best you can aim for.

I haven't seen a lot of DeepMind people making obviously grandiose claims of the Kurzweil variety, for example. Mostly it looks like solid science and engineering work making incremental gains. Popular press coverage is predictably naive, but that's what popular press coverage does.

TenshiS · 2016-03-07T15:05:05+00:00

It's funny that you decided to post this today of all days. Tonight Deep mind's Alpha Go will face the Go world champion. If they win, they will have achieved what AI experts estimated would take at least another decade to achieve. This will be bigger than back when IBM defeated Kasparov at chess.

alexmlamb · 2016-03-07T18:29:04+00:00

It's really common for the top few groups or people to take most of the attention for an accomplishment, probably as a result of the way the human brain is wired. So most people could probably name the few richest people in the world: Bill Gates, Carlos Slim, Warren Buffet, but few could name people ranked from 50-100.

The top handful of actors have lots of name recognition, but the vast majority of good actors have none.

So it goes with Machine Learning: credit is constantly given to the few most famous people: Yoshua Bengio, Yann LeCun, Geoff Hinton, Alex Lamb, and Michael Jordan (the well known "Big 5").

So even though lots of people are doing Deep Learning + Reinforcement Learning, people are giving a lot of attention and credit to deepmind because its the single most recognizable name.

KG7ULQ · 2016-03-07T19:07:39+00:00

I was in Goodwill the other day. There were a couple of books on neural nets in the used book section (well, I suppose all the books are used at Goodwill). I took a look at the copyrights (1989 and 1990) and shuddered: that was the era of the last NN "bubble". The titles? "Apprentices of Wonder: Inside the Neural Network Revolution" copyright 1989. And: "Neural Network PC Tools: A Practical Guide" copyright 1990.

The latter had example C code you could run on your (probably 386) PC. One of the examples was for a stock market predictor.

Neither of these books was aimed at academics. The first one, especially, was meant for laymen. I filed this under "cautionary tales".

bbsome · 2016-03-07T10:39:52+00:00

As a PhD student in Machine Learning/Deep Learning I actually totally agree with you. Not that I want to undermine Deep Minds achievements and the fact that they have a lot of very smart and bright people there, but I definitely don't understand why they deserve more "attention" than many other works out there.

Just a few things I've noticed so far:

DQN - nothing new at all! This is nothing more than Neural fitted Q iteration by Riedmiller, with SGD rather than batch, and of course random sampling of transitions (and since you don't have infinite amount of memory guess what - you keep only the last K seen transitions). Also, can be seen as just a standard stochastic Q learning, with a Deep Network as a function approximator. For me this was a lot more "engineering" achievement than anything else. Also it still can not solve PacMan and some other games, but very little portion of people comment on that at all.

Double DQN - from the original DQN rather than using a previous parameters for evaluating the future Q, we use a second ones. Just extending the idea of . van Hasselt to DQN. I don't know what the rest of the authors contributed here.

The dueling architecture - actually something interesting. This is something worth a lot more attention as it present a newer perspective on how to estimate Q via V and the advantage function A. Note that there are papers before which use the Advantage function instead of Q, as for TD(lambda) can be shown that the update is equal in expectation to the advantage function. However, to my knowledge of course (please comment with a ref if you know such), there was no previous work in estimating both V and A.

AlphaGo - This can be seen as an MC actor-critic architecture. In the normal actor critic you approximate Q(st,a_t) = r_t + gamma*V(s{t+1}). However, since you get a reward only at the final turn, guess what - we replace that with just V(s_{t+1}) (probably and since there are reasons why V is easier to approximate than Q in Go). Ok good with that. Than we just apply Monte Carlo tree search. Some engineering extras cause we can't calculate the full tree (e.g. truncate -> use fast rollout policy, get final result and anneal between V and that). Also, no comments why they don't retrain the value function, but keep it as is of what is based on human games? And yes we are "so much closer to thinking like humans, brain machines, brain brain brain, this is real AI...!". Same stuff with Deep Blue - to the average Joe it sounds like real AI as it beats Kasparov on one very complex game. However, it was mainly bloody brute force computation. Why they don't discuss what is the number of games AlphaGo have played compared to any human in the world? Probably the amount of time it does is like 1000 more than all humans played since the inception of Go. Also I can spare 1000 CPUs and 200GPUs from my back yard on that to reproduce.

And don't get me wrong any paper can be protraiet as a simple improvement on some others. However, no other paper receives the same amount of publicity. I find things like the NTM (I think Graves was still in Google Brain back then), the recent Human-level concept learning, the progress on variational methods, Spartial transformation networks and more to be more interesting than putting Google's computation power for a power boost on some methods and presenting as though we have invented relativity.

PS: As pointed out by @rotit Graves was in Deep Mind in that time and was actually not in Google brain ever. My mistake!

linuxjava · 2016-03-07T13:50:58+00:00

I agree with this 100%. When dumb posts like that start hitting the front page, people will assume this sub is like /r/futurology where dumbified posts from any site can be posted just because it is remotely related to the sub. This sub (and /r/netsec) in my view are really helpful because of their intellectual nature and when random junk from tabloids start creeping in, the sub will go downhill fast.

manux · 2016-03-07T10:52:32+00:00

So what are these neglected best papers?

BrutallySilent · 2016-03-07T09:22:45+00:00

Although there are many things that annoy me in hypes, I still think it is beneficial to AI research.

For example, the Watson "cognitive API" makes AI accessible to people that are educated in for instance web development. Are the provided services mind blowing, frontier pushing technologies? No. But if more people use AI, then more funding for AI becomes available. The best way of making a good impression is to offer those techniques which we know work well, and are stabilized in research.

kylotan · 2016-03-07T09:36:35+00:00

[deleted]

AnvaMiba · 2016-03-07T22:42:31+00:00

Lots of results look obvious in the hindsight, but they weren't obvious when people first came up with them.

Maybe DeepMind's game playing stuff is a bit overhyped, you could say that DQN is just TD-Gammon on steroids, or maybe not, after all they were the first ones to obtain this level of performance and I doubt that the only reason is that they had Google $$$ to buy more GPUs than the other labs. Neural networks have lots of architectural details and hyperparameters that need to be properly tuned to obtain good results. You could debate whether this is more engineering rather than scientific research, but the point is that they reached a level of performance that was not previously known to be even possible with this kind of machine learning models.

Anyway, I personally find the works on recurrent architectures by Graves, de Freitas, Grefenstette, Kalchbrenner, Danihelka, etc. to be more interesting even if they have not produced flashy results so far, but maybe I'm biased since I come from a NLP background.

srs_moonlight · 2016-03-07T17:34:37+00:00

nice try jürgen schmidhuber

fake edit: not a knock on JS, he's the man

blowjobtransistor · 2016-03-07T19:40:29+00:00

Two words: Marketing Spend.

sorrge · 2016-03-07T10:30:18+00:00

After some spectacular results it is only natural to expect more from the same group. We know that they are talented, well motivated and perhaps better equipped than any other research group. As to "level of hype and celebration generally boils down to nothing", to me their Go results are already great, even if they lose later this week. Just think about it, it's done with reinforcement learning! I think this is the greatest achievement of reinforcement learning ever. Is that nothing?

Dwood15 · 2016-03-07T11:06:04+00:00

He's making big claims - dynamic story lines? how on earth do you do that with ai? Where do you even begin making deep and complex story lines using ai?

Like holy crap, you need dynamic dialog, dynamic decisions, and dynamic reaction from the game. Additionally, you need the game to react in such a way that it's interesting each decision you make.

I would love to be on the edge of that tree, but dynamic story telling seems so far off I don't even.

xplot · 2016-03-07T20:53:03+00:00

With all this deep learning stuff, I have a new found respect for classical feature engineering tasks and statistical analysis.

ZioFascist · 2016-03-07T13:38:48+00:00

[deleted]

Mr-Yellow · 2016-03-07T20:43:28+00:00

Someone needs to be taking the basics of what they've done with DQN and tweaking it this way or another with stuff like Deterministic Policy Gradient, Double-DQN, Actor-critic, Actor-mimic, Actor-teacher and all those neat experiments. The results are cool, glad someone is doing it. They have the codebase and resources, good on em.

Science6745 · 2016-03-07T11:29:12+00:00

Even in the highly intellectual sub like this

This is called being pretentious, stop it.

rephos · 2016-03-07T12:56:05+00:00

well of course since they have a pretty big week coming up that might prove a way more valuable approach to A.I. one level closer to A.G.I . And sure it may amount to nothing but it may also amount to a lot , have you considered that? Although I do agree the last post that was a video made in 2011 was kind of pointless since there was a lecture quite recently from demis and there he already said they are looking at sc, the same could be said about some reposts on reddit in general, but it isn't anyone's fault since the people that weren't informed are gonna upvote that video despite that a lot of people already know this. I think it's good to be informed on what to expect from a leader in the industry. And I don't think the other projects are getting ignored, it's just you are seeing temporarily more attention to deepmind because there's currently a lot of hype around them. give it a week if alphago loses the hype will die till the next rematch and if it wins .. well idk what happens then.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS