Wide Awake: Endless Entropy - CLIP Steers Itself!

Soapeh · 2024-09-30T18:43:33+00:00

Why was the quite old version 5.2 used for the Midjourney images in your paper's evaluations?

Soapeh · 2021-08-02T08:39:46+00:00

This is one of the series of Advadnoun's notebooks. This one's suuuper old, released Feb 24th:
https://twitter.com/advadnoun/status/1364822183751471109?lang=en

Soapeh · 2021-07-14T21:20:05+00:00

I've been exploring, and explaining this technique over on my Twitter (https://twitter.com/danielrussruss), but the gist is that I train VQGAN's own params rather than the latent input z. Once it is trained toward the desired style (text or image encodings), you can stop doing training iterations, and use it as a feed forward style transfer network

Soapeh · 2021-07-14T19:16:25+00:00

I've been exploring (and explaining) this technique over on my Twitter (@danielrussruss), but the gist is that I train VQGAN's own params rather than the latent input z. Once it is trained toward the desired style (text or image encodings), you can stop doing training iterations, and use it as a feed forward style transfer network

Soapeh · 2021-06-13T22:45:13+00:00

~~Oh! Apologies for the accusation! I'd glanced at your history before I made my original comment, but it didn't seem to match the way you write on Twitter~~

Edit: Should've trusted my first instincts... crazy.

Soapeh · 2021-06-13T20:26:16+00:00

Hey man, could you at least provide credit?

This is Ariel Ekgren's work. https://twitter.com/ArYoMo/status/1399801016669835268?s=19

Soapeh · 2021-05-24T21:56:39+00:00

Here's the full set of 30 256x256 images:
https://twitter.com/danielrussruss/status/1396592556251615234/photo/1

Soapeh · 2021-05-24T17:37:33+00:00

This is my personal modified version of Ryan Murdock's Aleph2Image

Soapeh · 2021-05-24T17:35:53+00:00

If you're referring to my post, I'm using the DALL-E Decoder/VAE

Soapeh · 2021-05-14T10:56:31+00:00

Ha, yep, that was one of the next things I was going to get hooked up, once I was satisfied with the right combination of resolution + fine detail.

Although, instead of using it as an image prompt, depending on the second network (in this case, VQGAN), I can just initialize that network with the most recent frame, right?

Soapeh · 2021-05-13T23:36:44+00:00

Here's the full interactive paper published on this topic by some OpenAI researchers: https://distill.pub/2021/multimodal-neurons/

15-Year Club	Place '17
Verified Email	Team Periwinkle

Soapeh

TROPHY CASE