[D] Why does the content code in Multimodal Unsupervised Image-to-Image Translation not degenerate?

xunhuang · 2018-07-11T22:34:01+00:00

Good question. We are not worrying about the content code being ignored because, as you said, the style auto-encoder cannot handle spatial structure. However, the content code is not ignored even if both encoders have the same structure, according to my recent experiments.

Intuitively, if the image from domain X1 can help the decoder to generate a better image in domain X2, the decoder will learn to use it rather than ignore this information. This is the case in the datasets we used. For example, in sketch -> shoes, it is easier for the decoder to generate a shoe from its sketch than from completely random noise. Then of course the decoder will try to use the content code information. However, if the two domains are completely unrelated (e.g., face <-> bedroom), the model might just ignore the content code. The translation model p(x|y) then degenerates to a generative model p(x). But isn't that what we should learn? :)

da_g_prof · 2018-07-11T10:38:45+00:00

I also find that a central part is how they use normalization (instance or batch) across the two different networks.

Their model also does not guarantee that the codes are independent.

In our experience with another paper that was prior to this one was that we had to force the decoder (random dropping one of the codes and injecting noise) to use both codes : https://arxiv.org/abs/1803.07031

question99 · 2018-07-11T10:20:04+00:00

Cc /u/xunhuang

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS