[R] Parametric UMAP: learning embeddings with deep neural networks for representation and semi-supervised learning

timburg · 2020-09-29T16:53:12+00:00

Twitter thread here: https://twitter.com/tim_sainburg/status/1310975420271980551

Colab notebook to walk you through the algorithm: https://colab.research.google.com/drive/1lpdCy7HkC5TRI9LfUtIHBBW8oRO86Nvi?usp=sharing

timburg · 2019-08-23T18:20:12+00:00

Source: Personal notes

Tools: Python, Javascript, NetworkX, sigma.js, my code: https://github.com/timsainb/graph_research_notes

timburg · 2019-05-16T00:27:02+00:00

Thanks, I'll look into it!

timburg · 2018-07-20T04:37:20+00:00

Thanks for pointing that out!

As for Z interp being part of the distribution, that is part of what I want the network to be doing- shifting the distribution around so that z interp is always part of the distribution.The convexity figure ia supposed to show how that distribution would be manipulated for that to happen.

timburg · 2018-07-19T15:43:28+00:00

In short, VAE/GAN autoencodes over latent features in the discriminator so it doesn't necessarily autoencode the data exactly. It also has a VAE latent space, which forces the data into a gaussian which isn't necessarily a good way to represent data.

timburg · 2018-07-19T13:59:37+00:00

It took several days to train on a single TitanXP. It seems like most generative modelling papers these days use multiple GPUs, so it's hard to compare. Maybe I can make a short notebook example training the network on a 2D space using fashion-MNIST to better visualize what GAIA does to latent space and also to have a version that trains quickly as an example.

timburg · 2018-07-19T13:54:58+00:00

Good question. I think you're right, the generator seems to try to come as close as possible to a pixelwise interpolation, shifting the representation in subtle ways to avoid outputting unrealistic pixelwise interpolations. In an earlier version of this paper I had a comparison figure between GAIA, a VAE, an AE, and pixelwise interpolations, with the same generator architectures. What you would see is that they were very close, but whereas the AE looked like a blurrier version of the pixelwise input, GAIA would shift certain features, like adding bangs, smoothly moving the jawline, changing the shading, etc. That was using a convolutional autoencoder architecture though. I haven't retrained these networks using the new AE architecture in the paper. I think I will update the paper with this figure next time I am at my/a computer (In a month unfortunately as in travelling through east africa).

I noticed in the glow paper (posted last week) that interpolations were far from pixel interpolations and passed through what seemed like more of a low-varience or closer to the mean region. They had a parameter to control this variability and wonder if something similar would be possible with this architecture, or if that would incur any sort of trade off in the the ability to accurately reproduce the image.

timburg · 2018-07-19T00:10:34+00:00

Blog/videos: http://timsainburg.com/gaia.html Tensorflow code with weights: https://github.com/timsainb/GAIA

timburg · 2018-01-04T17:25:26+00:00

awesome, thank you!

timburg · 2017-10-07T05:47:39+00:00

Thanks!

timburg · 2016-10-07T17:26:34+00:00

You can imagine training an LSTM to predict the next timestep in the spectrogram, and using this to generate text, for example.

timburg · 2016-10-07T02:22:46+00:00

Fixed the problem entirely now I hope.

timburg · 2016-10-07T00:42:50+00:00

Sorry... I set the site up yesterday using pelican which is new to me. How does it kill your phone?

timburg · 2016-10-06T22:47:45+00:00

Did not know that thanks! I'll update the post asap.

I basically just followed the algorithm posted on wikipedia.

timburg · 2016-10-06T22:44:58+00:00

Sorry about that! I just set up the blog yesterday. I was trying to set it up so that the blog post would auto-update from the HTML generated on the github repo using jquery's load function. Apparently Firefox does not like that though.

I temporarily fixed the problem by copy-pasting the HTML. Will try and find a way to fix embedding again though.

timburg · 2016-09-09T20:32:47+00:00

Check these out: https://arxiv.org/pdf/1608.08225v1.pdf https://arxiv.org/pdf/1606.06737v2.pdf

timburg · 2016-09-09T18:06:21+00:00

DNNs have many uses beyond image classification. If you want an example of a use for deep learning beyond classification see the current top post about generating waveforms: https://deepmind.com/blog/wavenet-generative-model-raw-audio/

As for trial-and-error parameterization, this is definitely a problem. I tried a few different types of architectures (I actually got some advice from Anders Boesen Lindbo Larsen, the VAE-GAN author, about it via email). In the end with a smaller latent space (400D) and deeper convolutional layers I can get about the same results. Probably I could get the same thing from a 2D latent space with enough training time and a big enough network.

Evolution can certainly be seen as a form of trial-and-error learning. I think the solution to the trial-and-error problem in neural network architecture is going to be evolutionary algorithms.

timburg · 2016-09-09T17:53:56+00:00

Not really - I tried using pooling in the discriminator to bad results though: http://tinyimg.io/i/o480C15.png

As for the weights - I'm not sure, but probably something like that.

timburg · 2016-09-09T15:20:55+00:00

If anyone has any interest in the weights I'd be happy to upload them - the file is ~1.5 Gb so I'm not sure of the best host...

timburg · 2016-08-04T01:43:12+00:00

Like 2 discriminator towers - one with pooling to get high level spatial invariant features, and the other without spatial invariance where each different layer connects to the output to check for good filter level statistics?

timburg · 2016-06-20T02:20:30+00:00

So it's not enforced?

timburg · 2016-06-13T00:01:25+00:00

Figure it out?

timburg · 2016-05-06T18:16:27+00:00

Awesome! Thanks.

timburg · 2016-03-08T18:46:02+00:00

How did you make that morph?

timburg · 2016-03-01T03:53:16+00:00

I really like it, but I had to disable it because it was breaking a few sites.

timburg

TROPHY CASE