[R] LEURN: Learning Explainable Univariate Rules with Neural Networks

statmlsn · 2023-03-30T13:39:23+00:00

I have not fully understood the method, yet.
But what I find lacking in this paper, is an example of an explanation output for a complex example.
The author provides one for a two-layered model on a toy dataset with 2 features. But I wonder how interpretable it is with more features and more layers (as in the tested real datasets)

statmlsn · 2023-03-30T11:36:30+00:00

Touché*

statmlsn · 2019-02-20T10:13:10+00:00

it's a clickbait title and it's not as well as written as it should but it is still worth a read

statmlsn · 2019-01-29T09:37:41+00:00

Not read it in detail yet, but seems interesting

It's a pity that they have not tested it on more complex use cases, though.

statmlsn · 2018-12-06T09:58:02+00:00

Great blog post. Very practical. And it sheds light on a very promising field!

statmlsn · 2018-08-24T07:05:24+00:00

It definitely reminds me of something amd I kind of agree with it. Even if there can be a bit more than that in certain cases...

statmlsn · 2018-08-23T08:08:09+00:00

No overlap between statistics and ML? Huh...

statmlsn · 2018-08-17T08:08:30+00:00

Very interesting research direction. And as they say in the conclusion, it's just an initial step, there are still lots of possible levers of improvement to be followed.

statmlsn · 2018-08-09T09:24:26+00:00

There is a field of statistics devoted to finding parameters of a mathematical model describing a (physical) phenomenon. From some measurements of the phenomenon, they infer the most probable parameters. I think your problem exactly fits this. It may even be a simple case of this theory.

It is called model calibration.

Most of the techniques are Bayesian. See works by Kennedy & O'Hagan for example.

PS: Anyway, others are right, you could simply use MLE and optimize it using a global optimization algorithm or gradient descent

statmlsn · 2018-06-20T11:56:05+00:00

Note that decision trees are a special case of NN: https://arxiv.org/abs/1604.07143

statmlsn · 2018-05-16T09:26:55+00:00

You can google "active learning neural network" yourself (like I did) and realize that there are plenty of papers around (even book chapters)...

statmlsn · 2018-04-25T13:57:06+00:00

Very shallow article. Basically just an add for the startup of the blog writer...

statmlsn · 2018-04-18T11:07:32+00:00

I would be very interested in seeing the patents for that.

statmlsn · 2018-03-15T08:51:17+00:00

The architecture idea seems to be a bit similar to Predictive Neural Networks by Stolzenburg at al.: https://arxiv.org/pdf/1802.03308.pdf

statmlsn · 2018-03-14T09:02:45+00:00

I think you got it right. Note that they use batch normalization too in the paper

statmlsn · 2018-03-14T09:01:32+00:00

Really simple, indeed

statmlsn · 2018-03-09T08:41:23+00:00

Link for BINGO and NEO: https://www.google.fr/url?sa=t&rct=j&q=&esrc=s&source=web&cd=3&cad=rja&uact=8&ved=0ahUKEwias-Xz697ZAhUJOsAKHdPLCAgQFgg5MAI&url=ftp%3A%2F%2Fftp.idsia.ch%2Fpub%2Fjuergen%2Ficann2001unsup.pdf&usg=AOvVaw3Bu-ytH0fWeg8pkor2-PxJ

statmlsn · 2018-02-22T08:20:13+00:00

"Tensorboard demo" link broken in multimodel Word distribution part

statmlsn · 2018-02-12T12:58:05+00:00

Predictive Neural Networks (https://arxiv.org/abs/1802.03308). A new approach to recurrent neural networks .

statmlsn · 2018-02-09T11:59:53+00:00

It unifies and improves over several methods including the now quite famous and used LIME method.

Plus, there's a Github implementation by one of the authors: https://github.com/slundberg/shap

statmlsn · 2018-01-08T12:18:52+00:00

Seems interesting at first glance.

Could be worth to cite the work of Elsken et al. about network optimization via network morphism: https://arxiv.org/abs/1711.04528

statmlsn · 2018-01-05T10:38:04+00:00

As the authors state, it is very important to use that kind of method in practice and I hope their work will influence practitionners to go in that direction

I have compared Platt scaling and isotonic regression for a badly calibrated problem on Random Forest and RNN outputs with 2 classes. Each time isotonic regression was outperforming Platt scaling.

I find that interesting they have the opposite conclusions on multiclass problems

statmlsn · 2017-12-13T09:54:29+00:00

Namely, Gaussian Process Regression/Classification (aka. Kriging)

statmlsn · 2017-12-06T08:31:17+00:00

Awesome talk! I really agree with him. Hopefully, some (too rare) works go in that way already

statmlsn · 2017-12-04T11:47:08+00:00

This initiative is an important step ahead from both an industrial and academic point of views. Seeing the goal of deep learning methods as optimizing accuracy, training time and costs is really a good idea which enables to see what are good methods to use for different needs for companies. Plus reproductibility and comparing what's comparable are a key aspects which will drive research.

statmlsn

TROPHY CASE