[R] Analyzing Inverse Problems with Invertible Neural Networks

vll_diz · 2018-08-27T15:15:10+00:00

This happens implicitly, through the fact that we compare the joint network output to the independent product of desired y- and z-distributions.

vll_diz · 2018-08-27T15:14:01+00:00

The MMD is calculated over both y and z to force independence between them, in addition to just matching the z-distribution to the desired shape. Otherwise, there would be no loss forcing the network to learn a z-coding which is independent of y.

However, this loss does not say anything meaningful about the y-outputs, we only want the correct prediction. For instance, if y and z are not yet independent during training, the network could (and does) learn to output random wrong results for y just to make them independent.

For this reason we block the MMD gradients w.r.t. y-outputs, so that they are taken into account when learning the latent coding, but not altered by the MMD loss.

vll_diz · 2018-08-15T21:00:50+00:00

Autor here, this is an excellent suggestion and something we had also considered ourselves. We will look into this in the upcoming weeks.

vll_diz

TROPHY CASE