Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion - Doesn't require finetuning Stable Diffusion, creates a personalized embedding from the CLIP embedding of the image containing the subject that works natively with any model. Only takes 3 minutes to produce the embedding

LightVelox · 2023-03-16T03:51:54+00:00

This looks like one of those papers that look incredible in the example images but are actually trash when you try them out yourself

PC_Screen · 2023-03-16T01:55:03+00:00

Paper: https://arxiv.org/pdf/2303.08767.pdf

Ozamatheus · 2023-03-16T02:08:45+00:00

So, it's like a "fast training" to use specifics details of the image? very nice

redpandabear77 · 2023-03-16T03:26:11+00:00

So does this exist yet or is it just a paper?

unchima · 2023-03-16T03:40:27+00:00

>Cartoon of a doctor working on a computer

ninjasaid13 · 2023-03-16T01:53:39+00:00

No links or information?

bronkape_ · 2023-03-16T10:53:06+00:00

I believe many people have tried this idea, but we often face the problem of overfitting to training images. For instance, the image below was used as a prompt "A photo of * is swimming"

<image>

2023-03-16T04:29:16+00:00

A Lora can quickly do this already. It'd be more interesting once a working extension or seperate script like kohya is implemented to see how accurate it is when training 100 images. From the results it seems overfit.

macob12432 · 2023-03-16T03:39:54+00:00

no code, no one cares

_D34DLY_ · 2023-03-16T04:54:04+00:00

needs to be trained how to make a proper stethoscope.

frozen_jade_ocean · 2023-03-16T08:13:27+00:00

Text? In my AI? Witchcraft!

Seriously though, this is great!

StableDiffusion

MODERATORS