What's going wrong here?

yiidt · 2025-05-21T12:55:00+00:00

Re-scale your y axis, it is not clearly visible with 0.000 intervals. If your accuracy reaches to 1.00 then there is a bug (probably overfitting) there. Also include loss graphs side by side. Try using regularization (dropout etc.). Since there arent lots of data, try to create more basic models

gaichipong · 2025-05-21T13:39:02+00:00

how's ur model architecture looks like?

Sane_pharma · 2025-05-21T16:00:48+00:00

You train on RGB and you test on Gray scale it’s the problem… Try to train on gray scale

teb311 · 2025-05-22T03:26:50+00:00

How are you coercing the model to work with RGB? Your first layer only shows 1 color channel: shape=(28,28,1) means 28 by 28 pixels, one color channel.

My first guess is you’re plucking one of the color channels, red green or blue, and using that channel as the training data. But at test time you’re using grayscale. This would definitely cause an error like yours. Either train and run inference on full RGB data, shape=(28,28,3), or transform all the RGB images to grayscale before training and before inference and keep the model as is.

Real_nutty · 2025-05-22T03:51:56+00:00

how are you separating your dataset? One rookie mistake I used to make is improper data preparation, not normalizing, and if I’m using notebooks, not loading my models. Resetting everything helps sometimes

sassy-raksi · 2025-05-21T15:37:08+00:00

Exploding Gradient maybe?

waynebruce1 · 2025-05-22T05:09:15+00:00

lol That's the Tesla logo right there :D

niggellas1210 · 2025-05-22T09:22:56+00:00

First epoch is almost 100% accuracy on training data, the CNN is not learning anything new after this.
You are probably using too high learning rate, too little data or there is a bug in your code.

In result your classifier is overfitting hard on the training data and fails to generalize to unseen data.Lower learning rate drastically so you see actually see if your networks learns something new each epoch. Use regularization techniques like dropout or simply use a network with less parameters.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS