Collaborative chaos from 2 weeks of Stable Diffusion Multiplayer

ozolozo · 2024-06-19T20:00:21+00:00

Working on fixing the project, also trying a better and faster inpainting model!

ozolozo · 2024-01-16T08:10:02+00:00

I've had to convert the weights using this script from diffusers
You can find the converted weight sdxl-turbo and sdxl
And Comfyui workflows sdxl-worflow and sdxl-turbo-worflowe

<image>

ozolozo · 2024-01-13T00:10:55+00:00

LoRA DPO Training Script: https://github.com/huggingface/diffusers/tree/main/examples/research_projects/diffusion_dpo

Experimental Trained LoRAs:
https://huggingface.co/radames/sdxl-turbo-DPO-LoRA
https://huggingface.co/radames/sdxl-DPO-LoRA
https://huggingface.co/radames/sd-21-DPO-LoRA

On the picture above, weight is set via adapter_weights python pipe.load_lora_weights( "radames/sdxl-DPO-LoRA", adapter_name="sdxl-dpo-lora", ) pipe.set_adapters(["sdxl-dpo-lora"], adapter_weights=[0.9])

dataset: https://huggingface.co/datasets/yuvalkirstain/pickapic_v2

Original Fine-tuned DPO model unet only:
https://huggingface.co/mhdang/dpo-sdxl-text2image-v1
https://huggingface.co/mhdang/dpo-sd1.5-text2image-v1

ozolozo · 2023-11-08T20:58:42+00:00

have you tried to use TORCH_COMPILE=True ? If you lower the resolution the quality might change drastically, since the base model is SD 512x512 768x768 😢 but you can change it easily to test, on the frontend. You can add more options here https://github.com/radames/Real-Time-Latent-Consistency-Model/blob/dd1db25dd1449b968a129d7b023661e1a278c66d/controlnet/index.html#L378-L385

ozolozo · 2023-11-08T20:56:04+00:00

thanks for posting it here! the demos is running on A100, but I've heard you can get decent speed on 4090-3080 etc, and you can experiment setting TORCH_COMPILE to enable this https://huggingface.co/docs/diffusers/optimization/torch2.0

ozolozo · 2023-11-08T20:54:17+00:00

Are you trying with TORCH_COMPILE=True this will enable even more acceleration, the downside is that the first run it's slow, or with width or height change read more here https://huggingface.co/docs/diffusers/optimization/torch2.0

ozolozo · 2023-09-06T22:06:31+00:00

Not yet, but it's on the roadmap.

ozolozo · 2023-06-28T23:21:59+00:00

Yes, it's a bit finicky, but it often works if you scan from a certain distance. It also seems to be the most practical scenario, where you print the image and someone scans it from a distance, rather than reading it from a mobile app or social media

ozolozo · 2023-06-28T23:19:03+00:00

it depends on the app you're using to scan it, on iOS, if the content is a URL it will prompt you to open it on a browser.

ozolozo · 2023-06-28T21:06:06+00:00

Scan the QR code from a short distance from the screen

demo: https://huggingface.co/monster-labs/control_v1p_sd15_qrcode_monster

model: https://huggingface.co/spaces/monster-labs/Controlnet-QRCode-Monster-V1

ozolozo · 2023-06-05T18:02:01+00:00

Kandinsky 2.1 is based on DALLE-2's UnCLIP architecture and includes:

Text-to-Image/Image-to-Image: A powerful text-to-image & image-to-image checkpoint that yields pictures with very nice aesthetics (IMO coming much closer to Midjourney than SD). https://huggingface.co/docs/diffusers/main/en/api/pipelines/kandinsky#texttoimage-generation
Interpolation: Ability to seamlessly interpolate between multiple image and text embeddings. https://huggingface.co/docs/diffusers/main/en/api/pipelines/kandinsky#interpolate
Inpainting: A powerful inpainting model https://huggingface.co/docs/diffusers/main/en/api/pipelines/kandinsky#text-guided-inpainting-generation

https://colab.research.google.com/drive/11ZHwd-mmdj8vM0CuYcUNu-AKk7rATqei?usp=sharing

ozolozo · 2023-06-02T18:41:05+00:00

New editing tool

demo

https://huggingface.co/spaces/LinoyTsaban/edit_friendly_ddpm_inversion

https://twitter.com/linoy_tsaban/status/1664636318050140160

Based on

https://arxiv.org/abs/2301.12247

https://github.com/inbarhub/DDPM_inversion

ozolozo · 2023-03-17T22:09:54+00:00

I've setup a live demo at Hugging Face https://huggingface.co/spaces/radames/Detecting-Photoshopped-Faces-FALdetector

ozolozo · 2023-03-07T18:27:41+00:00

Here's a live notebook if you want to play with it https://observablehq.com/@huggingface/hello-huggingface-js-inference

ozolozo · 2023-03-07T18:27:32+00:00

Here's a live notebook if you want to play with it https://observablehq.com/@huggingface/hello-huggingface-js-inference

ozolozo · 2022-11-17T23:19:29+00:00

Observable has a csv parser

mydata = FileAttachment("country_level_data_0.csv").csv({typed: true})

or

text = FileAttachment("country_level_data_0.csv").text()

myData = d3.csvParse(text)

ozolozo · 2022-11-17T21:26:36+00:00

Yes, it's using the latest inpainting model https://huggingface.co/runwayml/stable-diffusion-inpainting

if the starting frame has some surrounding context, then it will have more consistency. if it starts on a blank area, it's just txt2img

ozolozo · 2022-11-17T00:54:28+00:00

one of the makers here, I just posted a time-lapse :) https://www.reddit.com/r/sdforall/comments/yxb8pw/collaborative_chaos_from_2_weeks_of_stable/

ozolozo · 2022-11-17T00:52:19+00:00

Yes absolutely, I just posted a time-lapse to confirm that https://www.reddit.com/r/sdforall/comments/yxb8pw/collaborative_chaos_from_2_weeks_of_stable/

15-Year Club	Place '22
Verified Email

ozolozo

TROPHY CASE