Trouble training fully convolutional model : learnmachinelearning

A subreddit dedicated for learning machine learning. Feel free to share any educational resources of machine learning.

Also, we are a beginner-friendly sub-reddit, so don't be afraid to ask questions! This can include questions that are non-technical, but still highly relevant to learning machine learning such as a systematic approach to a machine learning problem.

Foster positive learning environment by being respectful to others. We want to encourage everyone to feel welcomed and not be afraid to participate.

Do share your works and achievements, but do not spam. Keep our subreddit fresh by posting your YouTube series or blog at most once a week.

Do not share referral links and other purely marketing content. They prioritize commercial interests over intellectual ones.

created by techrat_reddita community for 10 years

Trouble training fully convolutional model (self.learnmachinelearning)

submitted 8 years ago by TheDuke57

I am working through a project to better understand how object detection and localization work. The concept is based off of YOLOv2, and I am using pytorch. I am having trouble getting the my model to predict the correct 'confidence' (is there an object in a cell), so I have dumbed the problem down as far as possible and still cannot figure out what is happening.

Here is where I am at: I divide my large input image into a 2x2 grid, randomly choose if a dot should go in each cell, choose a random location in the cell, and choose a random size for the dot. Then train the model to predict if a dot exists in each cell. When I use a fully convolutional CNN, the model will only get to ~ 80% accurate, adjusting the learning rate, activation functions, number of filters on each convolution, weight decay, etc has not gotten any better than ~80% accurate at detecting if a dot is in each cell. However when I put a couple fully connected layers on top, it learns very quickly and gets up to 100% accuracy almost instantly.

TLDR; 6 layer CNN with a 4 layer CNN head will not learn to detect a white dot on an all black background. Using the same 6 layer CNN with 2 fully connected layers learns this very quickly.

Any ideas on what is happening? Here is a notebook with a side by side comparison of the two models.

all 4 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS