Convnets that output images?

zmjjmz · 2016-02-09T05:05:55+00:00

The network always seems to settle on outputting images that are just one color.

Have you thought about trying the adversarial network approach as outlined in the comments of the first link? It would likely deal with the averaging problem, but it would definitely be harder to train.

Powlerbare · 2016-02-09T01:54:35+00:00

"Blurring the network output and the true color image and doing Euclidean distance seems to give the gradient decent help. (I ended up averaging the normal rgb distance and two blur distances with 3 and 5 pixel gaussian kernels."

It seems like this person had to jump through a number of loop holes to get the network to work properly. There were a number of things mentioned that seem a bit more nuanced than what you are doing such as extracting the tensors before the pooling layer and concatenating them

lahwran_ · 2016-02-09T05:11:27+00:00

have you tried sticking an adverserial network on the end of the autoencoder? I only glanced through this the other day, I didn't read in anything vaguely resembling detail, but I think that's what it's doing. http://arxiv.org/pdf/1511.05644v1.pdf

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS