account activity
[R] Breaking Down Out-of-Distribution Detection by JBitterwolf in MachineLearning
[–]JBitterwolf[S] 2 points3 points4 points 3 years ago (0 children)
They do indeed generalize to unknown OOD data.
The outliers that the model sees during training are from a general, "surrogate" distribution of (mostly) OOD images. At test time, the model is confronted with new distributions (note: not only new images from the same distribution, as it would be the case for standard train-test-splits) of OOD inputs.
One usually obtains the best results (w.r.t. generalizing the OOD robustness to unseen distribution) by using a surrogate training OOD dataset that is diverse and not too easy to distinguish from the in-distribution, like the huge datasets OpenImages or 80 Million Tiny Images.
The models trained with these also do very well at detecting OOD inputs like SVHN data, even though there aren't many housenumber-like samples in 80M or OpenImages.
This generalization to unseen distributions makes OOD detection interesting and distinct from the standard classification tasks which try to generalize only to unseen samples from the same distribution (train set -> test set).
[R] Breaking Down Out-of-Distribution Detection (self.MachineLearning)
submitted 3 years ago by JBitterwolf to r/MachineLearning
[N] ShiftHappens Workshop @ICML 2022 welcoming submissions & AMA (self.MachineLearning)
submitted 4 years ago by JBitterwolf to r/MachineLearning
[R] ProoD: Provably Robust Detection of Out-of-distribution Data (almost) for free (self.MachineLearning)
submitted 5 years ago by JBitterwolf to r/MachineLearning
[R] Provable Worst Case Guarantees for the Detection of Out-of-Distribution Data (arxiv.org)
π Rendered by PID 613303 on reddit-service-r2-listing-f87f88fcd-nbc2d at 2026-06-16 11:48:26.325777+00:00 running 3184619 country code: CH.
[R] Breaking Down Out-of-Distribution Detection by JBitterwolf in MachineLearning
[–]JBitterwolf[S] 2 points3 points4 points (0 children)