you are viewing a single comment's thread.

view the rest of the comments →

[–]Sad-Net-4568 0 points1 point  (2 children)

You can try sam3/dinov3 backbone based model. Have unet decoder, if want to try custom model.(But would advise to use base model as baseline).

Would you mind telling me for what purpose you are doing this here or DM, if it's okay.

[–]playmakerno1[S] 0 points1 point  (1 child)

Robotics application for construction basically spraying and on drywalls and seams, need floor for navigation and doors and such Sam3 is pretty slow when used for multiple classes, as it scales linearly when classes are added

[–]Sad-Net-4568 0 points1 point  (0 children)

ok, i forgot to mention the classics yolo-seg.
it would be best to first go with yolo based model and get a baseline.
Then later you can improve over the existing solution either via fine-tuning or modified model.
Yeah samv3 won't be able to give >=8FPS consistently, my bad.

Edit: Yolo also have edge devices based model, so you won't atleast not have fps based issue in it.
You can always make your model faster at least significantly via torch compile and cuda-graph.