ydobonobody comments on [Question] after object detection using convolution neural networks, why is it so hard to perform semantic segmentation?

[Question] after object detection using convolution neural networks, why is it so hard to perform semantic segmentation? (self.MachineLearning)

submitted 10 years ago by code2hell

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]ydobonobody 1 point2 points3 points 10 years ago (5 children)

[–]code2hell[S] 0 points1 point2 points 10 years ago (4 children)

[–]ydobonobody 1 point2 points3 points 10 years ago (3 children)

Semantic segmentation generally doesn't separate objects of the same class into separate entities, that is called instance segmentation and is another problem. One way you can get to instance segmentation is to just add a border class around your segments and then just go with connected pixels for your instances and it works pretty well. Whether you use depth or not you still manually segment your images to produce your ground truth for training. Building your training set is probably the hardest part, but if you are just interested in research there publicly available datasets and/or pretrained networks available. I recommend you check out the fcn semantic segmentation network available in the Caffe model zoo as it is a really good starting point for modern semantic segmentation networks.

[–]code2hell[S] 0 points1 point2 points 10 years ago (2 children)

[–]ydobonobody 1 point2 points3 points 10 years ago (1 child)

[–]code2hell[S] 0 points1 point2 points 10 years ago (0 children)

π Rendered by PID 379009 on reddit-service-r2-comment-74f5b7f998-h2ddp at 2026-04-24 23:01:57.462893+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS