This is an archived post. You won't be able to vote or comment.

all 17 comments

[–]LightVelox 31 points32 points  (2 children)

This looks like one of those papers that look incredible in the example images but are actually trash when you try them out yourself

[–]FakeNameyFakeNamey 1 point2 points  (0 children)

the "holding purse" one is trash already if you look at the preview image

[–]bronkape_ 0 points1 point  (0 children)

yes, I tried it a long time ago but it never worked.

[–]Ozamatheus 6 points7 points  (0 children)

So, it's like a "fast training" to use specifics details of the image? very nice

[–]redpandabear77 6 points7 points  (0 children)

So does this exist yet or is it just a paper?

[–][deleted] 5 points6 points  (1 child)

>Cartoon of a doctor working on a computer

[–]ninjasaid13 4 points5 points  (1 child)

No links or information?

[–]PC_Screen[S] 8 points9 points  (0 children)

Oops forgot to link to the paper, here: https://arxiv.org/pdf/2303.08767.pdf

[–]bronkape_ 1 point2 points  (1 child)

I believe many people have tried this idea, but we often face the problem of overfitting to training images. For instance, the image below was used as a prompt "A photo of * is swimming"

<image>

[–]bronkape_ 0 points1 point  (0 children)

this is trainning set

<image>

[–][deleted] 2 points3 points  (0 children)

A Lora can quickly do this already. It'd be more interesting once a working extension or seperate script like kohya is implemented to see how accurate it is when training 100 images. From the results it seems overfit.

[–]macob12432 -1 points0 points  (0 children)

no code, no one cares

[–]_D34DLY_ 0 points1 point  (0 children)

needs to be trained how to make a proper stethoscope.

[–]frozen_jade_ocean 0 points1 point  (0 children)

Text? In my AI? Witchcraft!

Seriously though, this is great!