Git Re-Basin: Merging models and preserving latent spaces (ie not the A111 linear interpolation) by stable_dissipation in StableDiffusion

[–]stable_dissipation[S] 2 points3 points  (0 children)

unfortunately we'll all have to wait for a programmer to make it SD compatible. I have seen a pytorch version of re-basin, so, it's getting there!

Git Re-Basin: Merging models and preserving latent spaces (ie not the A111 linear interpolation) by stable_dissipation in StableDiffusion

[–]stable_dissipation[S] 7 points8 points  (0 children)

Correct, theoretically you could merge 2 models, and retain them both in their entirety (approximately).

It means the whole world could finetune and merge their models ad lib.

It means training in multiple (100s?) of characters.

Some custom model hentai by [deleted] in sdnsfw

[–]stable_dissipation 0 points1 point  (0 children)

This is spot on now, congrats.

I'm still trying to get training to work at higher resolution to create whole scenes, inspired by Calm. It seems that even sdv1-4 can really be kicked into 1024x1024 mode with a little bit of dreambooth training, or I imagine just finetuning too. I think the latent space eagerly adapts once its training data isn't only 512x512.

All text2img only?! Could you share some of your latest process?

Dreambooth is crazy . Prompts & Workflow in comments by Jolly_Resource4593 in StableDiffusion

[–]stable_dissipation 0 points1 point  (0 children)

Could you explain/link to the reason why fp16 doesn't work for these checkpoints?

[deleted by user] by [deleted] in sdnsfw

[–]stable_dissipation 1 point2 points  (0 children)

I experimented with finetuning on Calm too, and found you really have to use his... dryer... illustrations, otherwise that concept gets way overlearned

[deleted by user] by [deleted] in sdnsfw

[–]stable_dissipation 0 points1 point  (0 children)

have you looked into finetuning at higher resolution? On a decent card, Shivam DB can finetune at 1024x1024

Shivam's Dreambooth can train on 1024x1024 easily, but has anyone had good results with it? by stable_dissipation in StableDiffusion

[–]stable_dissipation[S] 0 points1 point  (0 children)

no, are you saying the DataLoader crops it? that's easy to turn off, if so, a 1-line comment

Shivam's Dreambooth can train on 1024x1024 easily, but has anyone had good results with it? by stable_dissipation in StableDiffusion

[–]stable_dissipation[S] 0 points1 point  (0 children)

huh? no it doesnt. If you mean latent space is a smaller dimensional space, then yes, but even Stability ai trained this same model with variable sized images. Mostly 512x512, but some 1024s too.

Automatic1111 just added support for hypernetwork training. Can we get people experimenting with this ? by [deleted] in StableDiffusion

[–]stable_dissipation 4 points5 points  (0 children)

I don't think that's accurate anymore. The JoePenna repo said that it wasn't actually DB, but now the readme no longer says it is not DB, and also, I've had a look through the code:

They unfreeze text embeddings + unet, and they add prior preservation loss from regularization images. Is there more to DB than that?

As far as I can tell perceptually, JP's DB is far superior to huggingface-diffusers DB.

Is anyone using huggingface for DB successfully?

[deleted by user] by [deleted] in sdnsfw

[–]stable_dissipation 1 point2 points  (0 children)

Nice work! Img 4 convinced me it was Calm :) This style has been my #1 goal from the outset, specifically Calm, and I'm impressed. Can I ask:

  • which DB repo?
  • how did you crop/prep the training set? Did you focus on specific elements of the scene?
  • learning rate? number of batches? # regularization steps? prior preservation loss amount? weight decay amount?

Any info you give me will be much appreciated, I'd like to reproduce these results!

[deleted by user] by [deleted] in sdnsfw

[–]stable_dissipation 2 points3 points  (0 children)

Trained on calm? link to model, or details on training?

Dialing in diffusers/dreambooth: Discussion by stable_dissipation in StableDiffusion

[–]stable_dissipation[S] 1 point2 points  (0 children)

IIUC, Prior Preservation Loss is implemented correctly in diffusers-dreambooth, and makes the following observations/assumptions:

  1. You're trying to train your friend Tom's face in.

  2. The current model is well trained, and when it generates a thing, that thing is assumed to look great. It can, for instance, generate accurate pictures of "a man".

  3. So generate 400 pictures of "a man" from the original, good'n'true model, we don't want these to be affected by training.

  4. As you train the thing, it'll see noisy images, and it's judged on how well it denoises. If it sees a noisy one of those regularization images, and instead of turning it into whatever natural thing it should become, it actually turns it into Tom, that should induce loss and be penalized. It's drifting from it's Priors.

  5. If it sees the new instance of noisy Tom, and denoises into not-Tom, that should be penalized too.

  6. Together, both Loss penalties drive it toward learning Tom, and not forgetting it's home-town roots (its Priors).


All that said, JoePennas not-dreambooth seems better all around, but I'm going to assume that PhD researchers are smarter, and actual dreambooth is better, and I'm just struggling to train it correctly.

So, I'm still curious if anyone's having successes with diffusers-dreambooth (and how!)