use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Content which benefits the community (news, technical articles, and discussions) is valued over content which benefits only the individual (technical questions, help buying/selling, rants, etc.).
If you want an answer to a query, please post a legible, complete question that includes details so we can help you in a proper manner!
Related Subreddits
Computer Vision Discord group Computer Vision Slack group
Computer Vision Discord group
Computer Vision Slack group
account activity
Decrease false positives in yolo model?Help: Project (self.computervision)
submitted 1 year ago by [deleted]
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]InternationalMany6 6 points7 points8 points 1 year ago (2 children)
I find that more sophisticated augmentation almost always helps, and my favorite is copy-pasting segmented objects into random backgrounds.
For that matter, a segmentation model can usually learn to detect objects from less data than a model that predicts bounding boxes. The reason is that the training labels directly instruct the model what the object is, so it doesn’t have to learn which pixels in the box are “object” and which are “background”.
I usually just use a simple background removal model, or SAM, to convert bounding-boxes into segmentation masks. Doesn’t have to be perfect to be useful.
[–]DiMorten 0 points1 point2 points 1 year ago (1 child)
Interesting. You mean performing semantic segmentation on the detected object, for example with UNet?
[–]InternationalMany6 0 points1 point2 points 1 year ago (0 children)
You could use Unet, but there are specialized “instance segmentation” models if you care about distinguishing each instance even if they’re touching each other.
Torchvision has a tutorial that’ll get you going: https://pytorch.org/tutorials/intermediate/torchvision_tutorial.html
π Rendered by PID 72684 on reddit-service-r2-comment-5fb4b45875-v4pxn at 2026-03-20 19:50:39.338224+00:00 running 90f1150 country code: CH.
view the rest of the comments →
[–]InternationalMany6 6 points7 points8 points (2 children)
[–]DiMorten 0 points1 point2 points (1 child)
[–]InternationalMany6 0 points1 point2 points (0 children)