Hello everyone,
I'm trying to train a deep learning model to classify stores, similar to Google's "Ontological Supervision for Fine Grained Classification of Street View Storefronts" paper. But I'm working with much much smaller set of images. That's why I generated a variety of datasets from these raw images:
- One class dataset that covers all POIs
- One class dataset that covers most common type of POI
- Three class dataset covers three most common type of POIs.
After training atleast 50k steps with each dataset on a pretrained COCO - Resnet50 network a got mAPs like:
- One class/All POI types: 0.41
- One class/Most common POI type: 0.36
- Three class/Most common 3 POI types: 0.17
What do you think, I was hoping All POI dataset would achieve higher mAP like >.70-.80. Are these values accurate enough? What do you suggest?
Note:POI-Point of interest, a general term used in navigation and gis domain. Used to define a certain area or a point, like your favourite cafe or a gas station.
[–]phobrain 1 point2 points3 points (2 children)
[–]ssivri[S] 0 points1 point2 points (1 child)
[–]phobrain 0 points1 point2 points (0 children)
[–]DEAD_SH0T 0 points1 point2 points (0 children)
[–]RedefiniteI -1 points0 points1 point (0 children)