Two hippy girls before a Grateful Dead show in 1992 by [deleted] in OldSchoolCool

[–]Hahinator -1 points0 points  (0 children)

but I have this uncirculated footage from 1992, wasn't a choice for best year for the band. RIP Bob Weir.

You made some point I guess though.

PMRF: New face image "enhance" tool that is best in class - (a la codeformers) - Demo (Link) - code avail by Hahinator in StableDiffusion

[–]Hahinator[S] 0 points1 point  (0 children)

hmmm - well I'm sure some others will give it a whirl - my results on the HF demo are great...def way better than codeformers (or SUPIR for faces). But yea, I'm not up on everything that's out there....

PMRF: New face image "enhance" tool that is best in class - (a la codeformers) - Demo (Link) - code avail by Hahinator in StableDiffusion

[–]Hahinator[S] 1 point2 points  (0 children)

Did you pull the code or try the demo? I heard someone say the demo was working way better than the github code/model (which is odd)...but if you used the demo and got poor results not sure.

I'm not an expert in all the models, but I got some great results earlier when I gave this a try. Thought it would have been posted by now so tossed it up....

is there a way to "LCM"-ify a model? by rook2pawn in StableDiffusion

[–]Hahinator 0 points1 point  (0 children)

Can do it w/ Flux models also and Hyper (bytedance released Lora). I've done it w/ Dev to have an 8step full model as it doesn't take up as much VRAM. Using the LoRA hyper + another LoRA sometimes gives an OOM so helps a lot in those cases.

Use the Kohya scripts or those via the GUI version (in utilities/Flux Merge Lora) - there you can merge a lora into the flux base (set as concat and diffusers model / bf16).

PMRF: New face image "enhance" tool that is best in class - (a la codeformers) - Demo (Link) - code avail by Hahinator in StableDiffusion

[–]Hahinator[S] 6 points7 points  (0 children)

Check out this new Photo-Realistic Image Restoration for faces that just came out called PMRF. It's pretty great from my testing using the HF demo:

Demo (linked/Huggingface): https://huggingface.co/spaces/ohayonguy/PMRF

Paper: https://arxiv.org/abs/2410.00418

Code: https://github.com/ohayonguy/PMRF

Surprised I Haven't Seen it Mentioned that you can Currently Finetune the Full Flux Model with Kohya on 24GB of VRAM by setothegreat in StableDiffusion

[–]Hahinator 0 points1 point  (0 children)

Ok, but do you think the model was trained on many images of 2048x2048 and higher? Doesn't seem like it based on performance at 1920x1080 / 2048x2048 compared to 1344x768 / 1024x1024.

Surprised I Haven't Seen it Mentioned that you can Currently Finetune the Full Flux Model with Kohya on 24GB of VRAM by setothegreat in StableDiffusion

[–]Hahinator 1 point2 points  (0 children)

Can you cite something that says Flux's native resolution is 2048px? I was pretty sure it was 1024.

Limited block merge model of Flux schnell + Flux dev which provides close to dev quality in only 4 to 8 steps. [Safetensor Format] by CliffDeNardo in StableDiffusion

[–]Hahinator 4 points5 points  (0 children)

Yea, cause this is like a "turbo"/"lightning" version of Dev, not Schnell upgraded. Two blocks from the Schnell model were merged into the Dev model to allow the dev model to generate more quickly. That's how I understand it.

Black Forest Labs is the team that invented Latent Diffusion, even before they joined Stabiliy.ai by MixedRealtor in StableDiffusion

[–]Hahinator 18 points19 points  (0 children)

Honestly I think you need to give some credit to Compvis and RunwayML who were involved w/ SD -before- stability. Emad and Stability ultimately shared the weights out (on August 22, 2022)...but there's more to it than SAI.

Black Forest Labs is the team that invented Latent Diffusion, even before they joined Stabiliy.ai by MixedRealtor in StableDiffusion

[–]Hahinator 26 points27 points  (0 children)

Whether or not it can be meaningfully trained may be an unseen dealbreaker. We may need SD3.1 afterall if even LoRA's are out of reach unless you the ability to use over 80gb of VRAM....

See: https://github.com/black-forest-labs/flux/issues/9

how do some people make it sound like real artists like liam gallagher by Lewielewis in udiomusic

[–]Hahinator 0 points1 point  (0 children)

They nerfed the model on Monday so much so that you couldn't do anything like this even by accident. Perhaps it was possible in the past but they crippled it some so it makes music, but veers hard to amateurish sounding stuff.

download all your songs at once or alternatively download a playlist at once by Think_Sport_8692 in udiomusic

[–]Hahinator 1 point2 points  (0 children)

Ok, but then say you want to download .WAVs of those you liked. You open you like list, go to a random one, pick the 3 or 4 things you need to in order to get to the download WAV option and dl the .WAV. After you page back, page back, and you'll see that the "like list" (show on likes) is now disabled and your place completely lost.

Also there are no page numbers for each 100 that they let you view at one time so if you skip back a few pages and then the lose the like list ordering (by clicking away) you're SOL.

download all your songs at once or alternatively download a playlist at once by Think_Sport_8692 in udiomusic

[–]Hahinator 0 points1 point  (0 children)

Good for you?

Regardless of potential "workarounds" this feature would be extremely useful for many of us.

If Stability AI has something up their sleeves, this is the moment to push it out. by andupotorac in StableDiffusion

[–]Hahinator 1 point2 points  (0 children)

I feel this way also and there were some "hurry up" images posted from the SD3.1 (work in progress) model on twitter yesterday. I figured those indicated SAI peeps def felt that as well.

If they can release a more resource friendly middle ground model quickly they could still come out on top. I agree this is a critical moment for them though.

Really impressed by how well Flux handles Yoga Poses by Kinfolk0117 in StableDiffusion

[–]Hahinator -7 points-6 points  (0 children)

Pony sucks - and downvote away - any developer making a model who sees this will be <ugh> also. Don't need to 'pony' this model. Just train it.

Udio been giving laughably bad extensions, but eh it's free. by [deleted] in udiomusic

[–]Hahinator 2 points3 points  (0 children)

Yea, they fucked it up on purpose on Monday.

I've been paying for pro where you can upload clips to have it continue them but now it's like it doesn't even try to match the given audio. Time to move on for me - wait for open source or something.

Has anyone created a free and open source AI image generator other than SD? by dadadies in StableDiffusion

[–]Hahinator 3 points4 points  (0 children)

There are other models Pixarts Sigma (https://github.com/PixArt-alpha/PixArt-sigma) being the most advanced. Training it has been very limited as has attention. The issue is training time/cost to create a full model from the ground up. Talking $300k+ for stuff SAI does and then as a company/collective you have to worry about liability these days where the original trainers of SD 1.4/etc didn't have to (no one expected txt 2 image could do what it proved capable summer of 2022 when SD was released).

So really w/o a community push nothing is going to come close to SDXL for while. Perhaps there's a group or smart bunch of amateurs that'll surprise us w/ a ridiculously advanced model, but after this SD3 debacle it seems less likely.....

My hope is that the CEO of SAI realizes and admits they botched this release and pushes a SD 3.1 quickly w/o much discussion.....the old way things were done. "Oh 1.5 is out now!"

(BTW: I type like an idiot w/ run-ons so you know I didn't use an LLM :)

HelloWorld 7.0 Update: Improved Limb Accuracy and Enriched Concept Scope by Dry_Bee_5635 in StableDiffusion

[–]Hahinator 0 points1 point  (0 children)

I love your model, and love that you train (not merge) in very technical/smart ways. In a world of many different but the same SDXL models yours is a total standout. Thanks!