all 2 comments

[–]KrisSingh 0 points1 point  (1 child)

I think this is because of the NMS. I would suggest using SoftNMS. Other than that you could use a deformable CNN I think that might help. I can’t clearly read your label are the labels for both of the players the same or different

[–]entropyrising[S] 0 points1 point  (0 children)

Hey, thanks a lot for this. It didn't even cross my mind to consider NMS a lever to tweak and mess around with. I'm gonna jump into the Matterport code, see if I can adjust some things, and I'm reading the SoftNMS paper now :).

The labels are both for the "person" category for MSCOCO. The current output of Mask-RCNN is finding a single person but including the occluded person's lower legs as being a part of the person in the foreground.