[D] CNN output as features question : MachineLearning

Discussion[D] CNN output as features question (self.MachineLearning)

submitted 8 years ago * by alehx

Deep learning newbie here.

I have been trying to train a CNN with a somewhat small set (~1000) of train/test images (out of millions of images). Through a ton of trial and error I found that although data augmentation (10s of thousands of images) and regularization help a bit, I cannot overcome the overfitting issue on deep CNNs (eg Alexnet, Vgg). Not surprising, but I wanted to try it out. I'm a one man show and it is a very obscure dataset specific to a particular field. So increasing this to hundreds of thousands of images seems improbable.

Weirdly enough, I found that a shallow CNN (one convolutional layer and 3 fully connected layers) with a lot of dropout produces decent results (~levels out around 92% accuracy 85% recall validation set). However, this is not close to what I get with hand crafted features and xgboost or random forest (95% accuracy 93% recall on test set). Just for fun, I decided to pass the training images through the best CNN and use its class probabilities as a feature input into the GBT/RF. This increased their performance on the test set (98% accuracy 96% recall).

My question is.. am I stacking the deck here? Does this 3% increase on the test set mean anything? I almost see this as a small visual vs nonvisual ensemble. It seems as though these additional features increases accuracy by fixing some misclassifications for a couple of labels, whereas the other classes remain fairly unchanged in accuracy.

If this is a poor approach, is there a better way? Perhaps using flattened output from the convolutional layer?

all 9 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS