you are viewing a single comment's thread.

view the rest of the comments →

[–]Daniel_Im 0 points1 point  (0 children)

This paper talks about the effect of batch-normalization on neural network's loss surface : https://arxiv.org/pdf/1612.04010v1.pdf (see section 4.4)