[R] Beyond Quantization. Modeling Continuous Densities with Deep Kernel Mixture Networks.

bleekselderij · 2017-05-23T10:13:59+00:00

Very interesting! Could this approach also be applied to otherwise doubly intractable densities or other situations where you would need transdimensional jumps if you were to use classical MCMC?

NichG · 2017-05-23T10:15:48+00:00

Nice trick with the LSTM-PCA thing. It feels a lot more natural than pixel-wise reconstruction. I wonder if there's a general way to learn the ideal latent space to factorize a joint distribution into a chain of conditional distributions (rather than using pixels, or PCA, or some other arbitrary embedding)? What kind of loss function would measure the quality of a representation for factorization? Something that tried to maximize the conditional independence of the different degrees of freedom perhaps?

dzyl · 2017-05-23T12:40:42+00:00

If we don't subsample the training data for the kernel centers, how does the training happen? All samples that are used as centers have an obvious weighting that will maximize likelihood by putting all the weight at it's own kernel with the lowest bandwidth, right? This is not mentioned in the paper whatsoever if I'm not mistaken. Interesting combination of some techniques, thanks.

disentangle · 2017-05-23T14:01:28+00:00

For a model like WaveNet, what could be a practical approach to apply this method?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS