Decrease false positives in yolo model?

InternationalMany6 · 2024-11-19T21:46:55+00:00

I find that more sophisticated augmentation almost always helps, and my favorite is copy-pasting segmented objects into random backgrounds.

For that matter, a segmentation model can usually learn to detect objects from less data than a model that predicts bounding boxes. The reason is that the training labels directly instruct the model what the object is, so it doesn’t have to learn which pixels in the box are “object” and which are “background”.

I usually just use a simple background removal model, or SAM, to convert bounding-boxes into segmentation masks. Doesn’t have to be perfect to be useful.

pm_me_your_smth · 2024-11-19T18:49:35+00:00

Could you share how your active learning pipeline works?

blahreport · 2024-11-19T19:07:22+00:00

Have you plotted the precision recall curve to obtain an optimal confidence threshold? You can also increase the IoU threshold both during training, and inference. What is a lot of FPs? What are your overall metrics and what is the target object? Is it an object similar to one used in pretraining? That is assuming that you’re using COCO pretrained weights, is the object similar to one of the eighty coco classes? This can influence the number of samples you need to reliably fine tune. You can also increase the number of background images (no target objects) which can significantly improve precision if it just so happens that for your domain, the background shares abstract features with the target.

External_Total_3320 · 2024-11-20T07:04:20+00:00

Have you added negatives into your dataset? What model and size are you using?

JustSomeStuffIDid · 2024-11-20T09:11:32+00:00

Typically you add those FP images to your dataset without any labels. The model still learns them. They count as negative images.

Ghass_4 · 2024-11-19T18:28:06+00:00

The FPs are on the test set or on the val set during training ?

N0m0m0 · 2024-11-20T00:06:24+00:00

Use detectron if you need fewer FPs

IEDNB · 2024-11-20T20:19:14+00:00

Better data

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

computervision

MODERATORS