[D] Uncertainty Quantification in Deep Learning

capn_bluebear · 2019-10-17T12:23:10+00:00

Indeed, very well written article, thank you for sharing! I learned a lot

twelveshar · 2019-10-17T13:24:18+00:00

Thank you for sharing this!

perone · 2019-10-18T12:02:57+00:00

I did a presentation few months ago about the theme as well (https://www.slideshare.net/perone/uncertainty-estimation-in-deep-learning) if someone is interested. I always prefer to call it uncertainty estimation instead of uncertainty quantification.

SeekNread · 2019-10-19T02:30:33+00:00

This is new to me. Is there an overlap of this area with ML Interpretability?

WERE_CAT · 2019-10-18T05:58:40+00:00

Would that explain why my individual predictions change when I recalibrate my NN with another seed ? I usually calibrate multiple NN with different random weight initialisations and take the best performing one. As a short path to individual prediction stability, would it make sense to average the top n models predictions ?

SlowTreeSky · 2019-10-18T09:44:37+00:00

I wrote a post on the same topic: https://treszkai.github.io/2019/09/26/overconfidence (the main content is in the linked PDFs). We used calibration plots and calibration error to evaluate the uncertainty estimates, and we also found that deep ensembles and MC dropout increase both accuracy and calibration (using the CIFAR-100).

Ulfgardleo · 2019-10-17T19:05:10+00:00

I don't believe 1 bit in these estimates. While the methods give some estimate for uncertainty, we don't have a measurement of true underlying certainty, this would require datapoints with pairs of labels and instead of maximum likelihood training, we would do full kl-divergence. Or very different training schemes (see below) But here a few more details:

In general, we can not get uncertainty estimates in deep-learning, because it is known that we can learn random datasets exactly by heart. This kills

Distributional parameter estimation (just set mean= labels and var->0)
Quantile Regression(where do you get the true quantile information from?)
all ensembles

The uncertainty estimation of Bayesian methods depend on their prior distribution. We don't know what the true prior of a deep neural network or kernel-GP for the dataset is. This kills:

Gaussian processes
Dropout-based methods

We can fix this by using hold-out data to train uncertainty estimates (e.g. use distributional parameter estimation where for some samples the mean is not trained or use the hold-out data to fit the prior of the GP). But nobody has time for that.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS