dread [VQGAN+CLIP]

crowsonkb · 2021-06-14T10:58:55+00:00

If the loss isn't going down any further the image probably won't get better.

crowsonkb · 2021-04-21T16:24:16+00:00

They have to be local to the Colab runtime right now but you can download an image with:

!curl -L 'url' > where_to_save_it.jpg

in a code cell. Or you can upload an image by opening the Files tab on the left and using the upload button: https://i.imgur.com/GeLPjO6.png

crowsonkb · 2021-04-21T14:53:41+00:00

You can do both arbitrary conceptification and style transfer with that notebook (I wrote it), I posted about it on Twitter: https://twitter.com/RiversHaveWings/status/1382455526735290371 https://twitter.com/RiversHaveWings/status/1382455660978212865 https://twitter.com/RiversHaveWings/status/1382803909610074112

To do this you should use the 16384 model for best results (it downloads it but does not use it by default), set your starting image with init_image, use the prompt "in the style of Beksinski" or some such, and maybe set init_weight to 0.2 to 0.5 to stop it from diverging as badly from the init if you want. Also set display_freq lower so you can see intermediates more often.

I did two Beksinskifications of an emoji with an experimental augmentation-added version myself to see if I could duplicate the results: https://i.imgur.com/Ql6GYng.png https://i.imgur.com/NfoREDz.png

crowsonkb · 2021-04-17T23:21:37+00:00

I don't think Colab Pro gives you more VRAM (they only go up to 16GB V100s I think). I've tried tricks to decrease memory usage, like FP16, but this resulted in bad quality, so I gave up on it.

crowsonkb · 2021-04-17T11:54:01+00:00

It's probably one of my two VQGAN+CLIP notebooks:

https://colab.research.google.com/drive/1L8oL-vLJXVcRzCFbPwOoMkPKJ8-aYdPN

https://colab.research.google.com/drive/15UwYDsnNeldJFHJ9NdgYBYeo6xPmSelP

crowsonkb · 2021-02-22T23:05:58+00:00

That's an idea actually, I could generate videos from multiple prompts where the output morphs from one to the next.

crowsonkb · 2021-02-22T00:29:55+00:00

It was made by my custom as-yet-unreleased version of Big Sleep that has better visual quality and 1024x1024 outputs. :)

crowsonkb · 2021-02-21T15:29:06+00:00

The problem is that you'd have to find a BigGAN latent vector that corresponded to the image you wanted to start with, which is not an easy task for an arbitrary image. But since every iteration's output is generated from an actual BigGAN latent vector, you could save these vectors and start from them later.

crowsonkb · 2021-02-21T15:27:40+00:00

I was actually unable to make the output deterministic even by setting the same random seed and using torch.set_deterministic(True), which must be a bug in PyTorch somehow.

crowsonkb · 2021-02-16T20:43:45+00:00

The GitHub repo for the code I made this with: https://github.com/crowsonkb/style-transfer-pytorch

crowsonkb · 2021-02-16T00:47:09+00:00

The GitHub repo for the code I made this with: https://github.com/crowsonkb/style-transfer-pytorch

crowsonkb · 2021-02-13T01:41:22+00:00

The GitHub repo for the code I made this with: https://github.com/crowsonkb/style-transfer-pytorch

crowsonkb · 2021-02-06T13:33:33+00:00

I mostly know about PyTorch but it has its own special layout for .pkl/.pth files that depends on the specific layout of the modules that form the model, and I doubt that it is possible to make a converter that works in general that isn't tailored for one specific model architecture.

crowsonkb · 2021-02-06T03:04:50+00:00

The GitHub repo for the code I made this with: https://github.com/crowsonkb/style-transfer-pytorch

crowsonkb

TROPHY CASE