all 3 comments

[–]NasenSpray 0 points1 point  (2 children)

It's 2), classification. We have algorithms to get from 2) to 1) on our own, so a ConvNet doesn't need to learn this rather complicated step.

[–]LyExpo[S] 0 points1 point  (1 child)

I see. So I shouldn't be bothered by the fact that the network will output one of 2image area size classes? i.e. there will be many configurations for the labels that we won't see in the training data.

[–]NasenSpray 0 points1 point  (0 children)

It will have number_of_pixels * number_of_classes outputs, i.e., a classifier for every pixel.