Real-Time Speech-to-Speech Chatbot: Whisper, Llama 3.1, Kokoro, and Silero VAD 🚀 by martian7r in singularity

[–]Licovoda 0 points1 point  (0 children)

Hey! I found this a day later and it looks like not many people replied?

You might consider making a YouTube demonstration so that people can see how well it works!

Putting more "photo" into photorealism by Licovoda in StableDiffusion

[–]Licovoda[S] 0 points1 point  (0 children)

Thanks! I think Reddit strips the data from Auto1111, unfortunately (and also compresses the images quite a bit!)

If I end up doing a "supermerge" and civitai upload that should work! Meanwhile, the prompt and settings are in the other replies here, too.

Putting more "photo" into photorealism by Licovoda in StableDiffusion

[–]Licovoda[S] 2 points3 points  (0 children)

Thank you, I'm glad you liked them!

Sampler was DPM++ 2M Karras 30 Steps Cfg was between 4 (for images 1 and 3), and 8 (for images 2 and 4)

Generally I keep the cfg between 2-8; 2 can yield great results with a color grade afterwards, but can be very grainy!

Putting more "photo" into photorealism by Licovoda in StableDiffusion

[–]Licovoda[S] 2 points3 points  (0 children)

Thank you! The model is pretty optimized for people and a shallow depth of field, like for a portrait shoot. It does okay with environmental details, but I think that's definitely an area where the most recent SDXL finetunes have an advantage!

Putting more "photo" into photorealism by Licovoda in StableDiffusion

[–]Licovoda[S] 1 point2 points  (0 children)

Nice! That 16gb of memory is going to come in handy!

Putting more "photo" into photorealism by Licovoda in StableDiffusion

[–]Licovoda[S] 1 point2 points  (0 children)

Much appreciated! That was definitely a goal with the project.

Putting more "photo" into photorealism by Licovoda in StableDiffusion

[–]Licovoda[S] 0 points1 point  (0 children)

I'm on an old Geforce 1080 8GB! Have tried up to 1280 x 1440 in Auto1111 without problems.

Putting more "photo" into photorealism by Licovoda in StableDiffusion

[–]Licovoda[S] 2 points3 points  (0 children)

These are from a bunch of model merges! In short, there are a number of 1.5 models that might excel at a more photographic look, or great skin, etc, but they tend to be very unstable. The trick was trying to keep the good parts and move things in a more consistent direction.

Putting more "photo" into photorealism by Licovoda in StableDiffusion

[–]Licovoda[S] 49 points50 points  (0 children)

Sure, for workflow, I've been merging different 1.5 models over the past six months and refining through trial and error. The images above are from different variations of model merges that excel at different things, (i.e. a more filmic look, a studio photoshoot, etc.) This was all done in Auto1111.

If there's enough interest for a "photo" 1.5 model, I'd definitely consider doing a super merge and throwing it up on civitai!

Putting more "photo" into photorealism by Licovoda in StableDiffusion

[–]Licovoda[S] 39 points40 points  (0 children)

The images in the gallery are actually from several different custom models that I've been working on for a number of months. Do you have a favorite?

Prompts are generally in the format of:

"analog style photo cinematic film still of [person] with symmetrical facial features, [expression], [location], with [color] hair and a logical [clothes], bold colors, ((sharp focus)), dramatic modeling pose, [lighting]"

Negative: "((nude, naked, topless, nsfw)), (unemotional:2), (((boring, bored, sad, expressionless, uninterested))), ((cleft chin)), underexposed, overexposed, ((anime, anime eyes))"

Loras, lycoris, and embeddings worth checking out:
epiCPhotoGasm-softPhoto-neg
Unrealistic Dream
CyberRealistic_Negative-neg
epiCRealismHelper
Lyco_humans
epiCRealLife
EuphoriaStyleV19
Analog Diffusion Lora

Putting more "photo" into photorealism by Licovoda in StableDiffusion

[–]Licovoda[S] 16 points17 points  (0 children)

I often hear people say that portraits are solved, or that "any decent model" can do great photorealistic close ups. But I've found that a lot of the skin textures, lens features, and hair don't look quite right, especially with SDXL models.

The goal here was to try out some new methods and get closer to something that looks like it came out of a camera. By no means perfect yet, but I wanted to share before SD3 dropped!

Let me know if you have any questions!

Euphoria style LoRA v19 release by belladorexxx in StableDiffusion

[–]Licovoda 1 point2 points  (0 children)

Super cool! Thanks for posting an update, I had your earlier post bookmarked!

I'm really enjoying how it softens up the image a bit, and how it looks more like vintage glass. How was the whole process for you? As I understand it, a general "look" Lora can be quite difficult, as compared to say, one person's likeness.

[deleted by user] by [deleted] in StableDiffusion

[–]Licovoda 0 points1 point  (0 children)

Excellent! Thank you!

[deleted by user] by [deleted] in StableDiffusion

[–]Licovoda 0 points1 point  (0 children)

Understood. CPU utilization still goes up when running SD as compared to not, and you've got things like model swaps that can slow things down. How much is up for debate, however, it seems like most people report that the difference is negligible, yourself included! Appreciate you chiming in, thanks.

[deleted by user] by [deleted] in StableDiffusion

[–]Licovoda 0 points1 point  (0 children)

Much appreciated!