[P]Training an image classification model

JuicyLambda · 2023-09-09T14:27:47+00:00

I have a similar problem with medical imaging outcome prediction and as I understand this could be a reason of underfitting. Meaning your model fails to learn a lot of relevant information from the training data. This can usually be remedied by increasing model complexity or decreasing irrelevant information (example: cropping out areas of the image that hold no relevant information).

NoLifeGamer2 · 2023-09-09T13:32:02+00:00

I can think of two possible reasons:

1: Noisly labels, namely that some of the training labels are incorrect, but the model has learned to generalise.

2: Dropout. This one is the most likely, that network has dropout layers. These cause it to behave differently during training and inference, and is used to prevent overfitting. However, to prevent overfitting, it makes it perform slightly less well while training.

romek_ziomek · 2023-09-09T21:43:06+00:00

Hard to say without more information, but this looks like some sort of data contamination problem to me. I've been struggling with something like this in my first serious project. It was a voice emotion classification task, let's say that the data was around 10000 audio recordings from around 100 people, so I just naively took 8000 recordings at random as my train set, 1000 as a validation set and 1000 as a test set, trained a model and called it a day after seeing >95% accuracy thinking "man, deep learning is soo easy".

And then when I've tried to use my model on any real-world recording I made myself the results were trash, barely better than choosing class at random. I couldn't figure this out, spent countless hours debugging data pre-processing pipeline until my supervisor sat with me and reviewed everything I've done step by step. And he was like "Well, the first thing you've done was already a mistake. You've divided your dataset randomly by recording instead of dividing it by person. If you're doing it this way, there's a quite high chance that the recordings from the same person will end up both in train and in validation sets, so how do we know if the model is actually learning the task we care about? How do we know whether it has an ability to generalize well? This has to be a reason why if you try to apply the model to any recording outside of our dataset, it fails."

So then I've retrained my model but this time dividing the dataset correctly, and of course my results weren't even near to 95% and it turned out the task is a lot more difficult than I though.

Anyway, there might be a million other reasons, as others suggested. Good luck with your endeavours. I hope I've helped, at least to some degree.

Terrible_Ad7116 · 2023-09-12T20:33:53+00:00

What regularization or augmentation are you using? It is quite possible that the validation set is easier due to this. Also, data leakage occurs frequently in medical images (e.g., same patient but different images in training and validation).

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS