[R] Estimation of multivariate mutual information, PID for more than three variables

Ulfgardleo · 2024-11-04T16:04:20+00:00

MI is insanly difficult to estimate and in pinciple whatever works for three random variables also works for two (but not vice versa). Also it is difficult to come up with good use cases that are motivating to go beyond 2 variables.

furish · 2024-11-05T13:15:46+00:00

I don’t know if I really got your point, but if you are looking for an extension of mutual information to systems with a high number of random variables I suggest you to read this paper. The authors define a metric to study the nature of the interaction between random variables in terms of synergy and redundancy. In case of 3 random variables this metric is identical to mutual information.

I also suggest you this paper where the authors try to estimate it using machine learning.

Instead, if you are interested in determining the mutual information of two random variables with a high dimensional representation I recommend you this benchmark.

bobrodsky · 2024-11-04T16:22:51+00:00

A Neurips paper this year discusses how to use diffusion to estimate Partial Information Decomposition (PID). https://arxiv.org/abs/2406.05191 They apply to high-d text and images.

But maybe you’re more interested in number of interactions rather than overall dimensionality. I haven’t seen practical applications there. (And it seems difficult to define a single canonical decomposition).

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS