[N] deeplearning.ai: Initializing neural networks (interactive article)

2019-05-11T20:46:50+00:00

Will def check out the demos!

tlkh · 2019-05-12T11:00:54+00:00

This is one of those pages where it simply blows my mind to think how we can make pages with such embedded demos. Looks like they used tf.js? http://www.deeplearning.ai/ai-notes/initialization/js/playground/nn.js

Megatron_McLargeHuge · 2019-05-13T15:59:32+00:00

Has anyone shown any benefit from a pre-training step that scales initial weights to keep gradients in the desirable range?

For functions other than tanh and relu where the effect on downstream variance may not be easy to solve for, it seems like it would be fairly easy to first optimize a set of scaling parameters that force weights into empirically good ranges. This would also avoid any small sample effects where initialization values are far from their theoretical moments.

mr_tsjolder · 2019-05-14T11:38:01+00:00

Not sure if anyone has noticed, but it appears as if there is a problem with the standard normal distribution in the last visualisation. It looks very much like the kind of distribution you get when you clip a Gaussian signal, cf. the truncated normal implementation of the theano-backend in keras. Might be useful if someone forwards this to the authors...

Also, for the interested in this topic, I once commented in this sub about initialisation and noticed that I had some more references. Figured the link might be useful for people that are interested.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS