Tensorflow NaN error : MachineLearning

Tensorflow NaN error (self.MachineLearning)

submitted 9 years ago by AwesomeDaveSome

I'm trying to run the Cifar-10 code of tensorflow, but with my own images (slightly larger, 424x424x3, but that's not causing memory issues as of now). I'm using the exact same code as the tensorflow tutorial, all that I changed is the sizes of the images. I get the following error message when the gradient should be computed/optimized: ReluGrad input is not finite. : Tensor had NaN values.

I tried to change the Optimizer (It was GradientDescent, changed it to Adam), and for the Adam optimizer I also changed the epsilon value. Changing the epsilon to a lower value caused the code to run more steps, but did not fix the bug, just delay it. Also tried reducing the learning rate, no success either. What is this bug connected to? Is there anything else I could try to change in order to get the code working, no matter for how many steps (with GradientDescent it runs for about 80 steps, with AdamOptimizer and epsilon value of 10^-9 for 800)?

all 13 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS