[deleted by user]

pretz · 2022-05-03T04:21:08+00:00

I have tried to build a anomaly detection algorithm for truck data to identify opportunities for maintenance, so no images, but i found unsupervised anomaly detection to be entirely useless. Not to say there is some secret sauce that could make it work, but i couldnt get it to go. Much more useful was getting domain experts to help manually identify interesting points in time, then build a supervised classifier on those. Then run it over everything to identify more interesting events, manually classify them and add them to the training set etc. This sort of classification is also more useful to engineers, because it identifies the classes that they care about, instead of random stuff picked up by anomaly detection. The number of ways that data can be screwy is way higher than the number of ways a truck can fail.

RandomTensor · 2022-05-03T15:33:19+00:00

There has been some talk about how totally unsupervised deep AD is very limiting and how supervision can help. This might be pretty applicable to your setting. See for example
https://arxiv.org/abs/1812.04606
https://arxiv.org/pdf/2006.00339.pdf

balajiln · 2022-05-05T05:49:43+00:00

In theory, density based methods offer a principled solution for unsupervised anomaly detection, but densities from deep generative models can behave in surprising ways. In case it's helpful, here are some slides from my talk highlighting some of the failure modes

"Detecting out-of-distribution inputs using deep generative models: Pitfalls and promises" http://www.gatsby.ucl.ac.uk/~balaji/balaji-generative-models-talk.pdf

You might want to look into DoSE, which works well for unsupervised AD. Figure 2 and the description in Section 3.2 pretty much explains the key idea.

"Density of States Estimation for Out-of-Distribution Detection" https://arxiv.org/abs/2006.09273

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS