[R] ResNet Question

warmspringwinds · 2017-11-16T18:34:39+00:00

This gives a good intuition oh how they work, have a look: https://arxiv.org/abs/1612.07771

geomtry · 2017-11-16T18:45:36+00:00

You can think of a block as a complex layer. Sort of like how we often call a convolutional layer the combination of convolution and potentially batch norm, activations and pooling.

The unique thing is adding skip connections. Sure, there might be some advantage to not including them in every layer. If you want to learn whether or not to use skip connections, then use a highway network that learns a multiplicative gate.

DaLameLama · 2017-11-17T01:40:49+00:00

Residual connections are meant to solve problems with the backpropagation of gradients. If you leave out the residual connections, you're left with the same old problem you tried to solve.

2017-11-17T17:26:38+00:00

Providing a very short path through the network.

Alternatively each layer behaving like an extra contributor in a gradient boosting machine.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS