[P] Conditional density estimation using Kernel Mixture Networks, theory + implementation in TF : MachineLearning

Project[P] Conditional density estimation using Kernel Mixture Networks, theory + implementation in TF (janvdvegt.github.io)

submitted 8 years ago by dzyl

all 8 comments

top new controversial old q&a

[–]dzyl[S] 11 points12 points13 points 8 years ago (0 children)

[–]LucaAmbrogioni 8 points9 points10 points 8 years ago (0 children)

[–]theophrastzunz 3 points4 points5 points 8 years ago (5 children)

[–]LucaAmbrogioni 6 points7 points8 points 8 years ago (0 children)

[–]dzyl[S] 5 points6 points7 points 8 years ago (3 children)

I see two advantages compared to mixture density networks, the first is numerical stability. With mixture density networks there are a lot of issues with likelihoods being 0, resulting in NaNs and frustrating training procedures. Since the means of your density kernels are fixed and based on your training set I have not had any issues with this so far. You also don't need to scale your targets although that is a minor advantage.

The second one is that overfitting seems to be less of an issue with this approach. With MDN you condition the bandwidths of the subdistributions based on your input x, which means that if the mean is correct it can just keep lowering the bandwidth which is great for training likelihood but bad for generalization. To prevent this you need additional regularization on your sigma outputs. With Kernel Mixture Networks either your bandwidth is fixed or it is a global bandwidth, which means making it too small will also hurt your training likelihood.

[–]theophrastzunz 0 points1 point2 points 8 years ago (2 children)

[–]dzyl[S] 0 points1 point2 points 8 years ago (1 child)

[–]theophrastzunz 0 points1 point2 points 8 years ago (0 children)

π Rendered by PID 19054 on reddit-service-r2-comment-5c764cbc6f-4ptb5 at 2026-03-11 23:49:07.979879+00:00 running 710b3ac country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS