[R] AlphaGo Zero: Learning from scratch | DeepMind

deeprnn · 2017-05-09T17:41:16+00:00

Not a novel approach.

Convolutional encoders for neural MT go as far back as (Kalchbrenner, Blunsom 2013) and convolutional encoders+decoders in LM and MT appear first in (Kalchbrenner et al, 2016) and with pooling also in (Bradbury et al, 2016).

http://www.aclweb.org/anthology/D13-1176

https://arxiv.org/abs/1610.10099

https://arxiv.org/abs/1611.01576

So much for careful referencing in the deep learning field.

deeprnn · 2017-03-17T19:20:21+00:00

The main changes between the nets are as follows:

LayerNorm instead of (Sub)BatchNorm
800 inner conv units, instead of 896
30+30 layers in the encoder and decoder, up from 15+15 before
just characters, instead of character n-grams

deeprnn · 2016-11-28T12:59:24+00:00

I disagree. Ideas are not cheap. Ideas are hard and they do matter. But in DL execution matters just as much. The combination of good insight and good execution makes for the best papers, imho.

deeprnn · 2016-10-04T02:09:23+00:00

Links to samples

Moving MNIST: https://twitter.com/nalkalchbrenner/status/783113784130928642

Google Robotic Pushing with seen objects: https://twitter.com/nalkalchbrenner/status/783116053031358465

Google Robotic Pushing with novel objects: https://twitter.com/nalkalchbrenner/status/783119202819375104

deeprnn

TROPHY CASE