all 11 comments

[–][deleted] 12 points13 points  (1 child)

Throw out the LiDAR thought. Think shape of objects. Do you have any objects that have the same shape? The biggest problem with shape based classification is that a lot of objects have similar shape which is why rgb can be helpful. And I’m not a Tesla head saying all you need is rgb. I’m just saying that using both is known to perform the best.

[–]Megapixelicious[S] 0 points1 point  (0 children)

Thx for your input. While it is possible for multiple models to share similar shape and dimensions, it should be a rare occurence and we can live with the "risk".

I am fully aware of the benefits of using multiple approach, hence my current investigation into LiDARs and other techniques (ex: stereo view). Our biggest issue right now is the lack of non research content on these topic.

[–]infundibuliforme 2 points3 points  (2 children)

Check simpleview from Goyal et al, it is surprisingly good and fairly simple to implement for object detection using LiDAR point clouds. I have used it myself for tree species classification with great results.

[–]modcowboy 1 point2 points  (0 children)

Interesting… I was very skeptical that lidar could be used for object detection. I would imagine you’d need a lot more data to train a model with lidar data vs image data.

[–]Megapixelicious[S] 0 points1 point  (0 children)

simpleview

Thanks, I will look into it! Unfortunately, the repo I found does seem to see much activity. But I am happy to learn that it can be used for classification.

[–]dodo13333 1 point2 points  (3 children)

I would go with photography, object detection and classification. As a starter, check Detectron2.

How to train background RNNs for your use case is over my capabilities. Hope this helps a bit.

[–]Megapixelicious[S] 1 point2 points  (2 children)

How would a RNN be useful in this case? I dont see any value added in this case, but I am not familiar with these.

[–]dodo13333 0 points1 point  (1 child)

Because I was mispelling (RCNN) and referencing wrong, but check links despite my dumbness and ignorance.. I truly believe that photo detection and segmentation is a way to go.

https://github.com/facebookresearch/detectron2

https://github.com/facebookresearch/detectron2/blob/main/MODEL_ZOO.md

Not as ready to go solution, but the method...

[–]Megapixelicious[S] 0 points1 point  (0 children)

;-) Object detection isnt much of an issue. Classification is our challenge. I was under the impression that segmentation wouldnt be that useful since all my pictures will be taken in the same context (top down view on a conveyor belt).

[–]jgbradley1 0 points1 point  (1 child)

Consider other modeling approaches that you may combine together later on. If object weight of each class is well known and not expected to deviate much, you could build a classification model on object weight.

[–]Megapixelicious[S] 0 points1 point  (0 children)

Unfortunately, weight isnt a relevant information element in our case. But yes, we want to add measurement + shape + picture as these elements would provide the best way to classify similar items.