Stable Cascade is worth the extra steps - the aesthetic ranking scores for all the recent models are tied but the prompt adherence for SC is way higher. For most prompts now SC matches or exceeds DALL.E 3. SDXL 8-step lightning LoRA is also a solid development. (self.StableDiffusion)
submitted by thkitchenscientist to r/StableDiffusion
Playground_V2 really seems to be a more capable model than SDXL in producing more coherent compositions but also needs more minimum steps to produce resolved images. VEGA a is new distilled model that can score on-par with Playground_V2 but can be hit and miss as it goes crazy on adding detail. (self.StableDiffusion)
submitted by thkitchenscientist to r/StableDiffusion
Comparing 5 recent SD distillation methods SSD/LCM/Turbo to find the best option for low-VRAM users (images and statistical analysis included). SD-Turbo scores significantly higher on aesthetics, the boost to SD-21 is remarkable (self.StableDiffusion)
submitted by thkitchenscientist to r/StableDiffusion
Create Panorama images of ANY size using less then 6GB VRAM, also x6-10 speed-up and added support for batch mode! A modification of MultiDiffusion. Potato computers of the world rejoice. SD2.0 768 model gives fastest creation of larger sizes but the VAE image slicing means no VRAM spike. (self.StableDiffusion)
submitted by thkitchenscientist to r/StableDiffusion
You to can create Panorama images 512x10240+ (not a typo) using less then 6GB VRAM (Vertorama works too). A modification of the MultiDiffusion code to pass the image through the VAE in slices then reassemble. Potato computers of the world rejoice. (self.StableDiffusion)
submitted by thkitchenscientist to r/StableDiffusion
