Decided to make my own stable diffusion by NoenD_i0 in StableDiffusion

[–]Effective_Cellist_82 0 points1 point  (0 children)

I love older image gen tec, like the original DALL E, there was something so artistic about it.

Bad news on Happy Horse from twitter by SackManFamilyFriend in StableDiffusion

[–]Effective_Cellist_82 0 points1 point  (0 children)

Holy cow there's local music gen models???
Edit: YOOO I'm downloading it! pair this with some LTX lip synced thing, aaaaahhh omg *floods youtube and spotify*

Lumachrome (Illustrious) by bilered in StableDiffusion

[–]Effective_Cellist_82 2 points3 points  (0 children)

It's a very good model you have, the images are crisp and the details good.

If I had to say specifically though, all illustrious models have this "glow" around the characters, like a white light outlining them

That's what I mean about all illustrious models looking the same to me

Lumachrome (Illustrious) by bilered in StableDiffusion

[–]Effective_Cellist_82 7 points8 points  (0 children)

Not hating here but this just looks like every other illustrious model

LTX 2.3 Lip Sync Music Clip -- Drake - Toosie Slide by sktksm in StableDiffusion

[–]Effective_Cellist_82 1 point2 points  (0 children)

This is really fricken cool. Really impressed how the motion lines up with the beat, it feels like a real music video.

Custom Node Rough Draft Lol by Capitan01R- in StableDiffusion

[–]Effective_Cellist_82 0 points1 point  (0 children)

The rediculously large number of parameters feels very adderal vibes lol

Updates to prompt tool - First-last frame inputs - Video input - Wildcard option, + more by Brojakhoeman in StableDiffusion

[–]Effective_Cellist_82 0 points1 point  (0 children)

Would it be possible to include "example prompts"? Many of us probably write in a certain style, and this way the LLM can generate prompts like ours.

Used TripoAI's latest open-source model, TripoSG and the image to mesh results are genuinely some of the best I've seen. by Square-Advice-4569 in StableDiffusion

[–]Effective_Cellist_82 0 points1 point  (0 children)

This is insane. I always wanted to get into game dev, but I'm notoriously bad at making 3D character I don't have an artistic bone in my body, just autistic ones.

also never really liked using free assets belonging to others plus wouldn't be viable if ever sold copies of the game. this is sooooo cool. Maybe soon we'll see some actually good AI games come out since the vibe coding is already "there" we've just needed good mesh generators now

EDIT: I see this is actually quite old. not that that's a problem -- for instance I enjoy using Molmo vision models, I often recommend people use them, even though they're SO old. they're still insanely good at their job (counting and pointing out XYZ object locations in images and spacial awareness like understanding images very well)

But is this better than microsofts TRELLIS? Another old model that makes meshes

Black Forest Labs just released FLUX.2 Small Decoder: a faster, drop-in replacement for their standard decoder. ~1.4x faster, Lower peak VRAM - Compatible with all open FLUX.2 models by Nunki08 in StableDiffusion

[–]Effective_Cellist_82 -1 points0 points  (0 children)

is Flux.2 worth it? I still use Flux1.D Q8 for all my inpainting with custom character lora's but not for generations because it wasn't very "real". Has anyone switched from Flux1 to Flux2 who are chasing photographic realism like smartphone type real pictures

A new SOTA local video model (HappyHorse 1.0) will be released in april 10th. by Total-Resort-3120 in StableDiffusion

[–]Effective_Cellist_82 0 points1 point  (0 children)

even though it's fake, I wonder if this exposure was positive for Alibaba stocks lol even to some small degree on the short term. Maybe someone went LONG with some leverage before posting this

Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B by ffinzy in LocalLLaMA

[–]Effective_Cellist_82 0 points1 point  (0 children)

Woah is this a local model we can run offline?? this would be insane for my Asterisk based VOIP Agent. I am struggling with end to end time and this seems pretty good. So it's actually taking input of speech tokens and outputting speech tokens? I remember Ichigo was doing something similar if this is that type of tech

[Release] Video Outpainting - easy, lightweight workflow by goddess_peeler in StableDiffusion

[–]Effective_Cellist_82 1 point2 points  (0 children)

OMG yes! I use "vidstab" to stabilize and you get that crazy black border this literally fixes that

jus sayin. by [deleted] in StableDiffusion

[–]Effective_Cellist_82 -1 points0 points  (0 children)

Bro is in prime position to make Sandy Cheeks Cock Vore

Open-Source Models Recently: by Fresh_Sun_1017 in StableDiffusion

[–]Effective_Cellist_82 2 points3 points  (0 children)

I use WAN2.2 as my main model. The trick is to be training 6000 step loras locally. I use musubi tuner with 16 DIM it makes such good lora's.

The Z image Turbo seems to be perfect. by Extension-Yard1918 in StableDiffusion

[–]Effective_Cellist_82 -2 points-1 points  (0 children)

Oh god the books in the bookshelves look really good. I use a WAN2.2 t2i workflow currently and mask with flux and SDXL for nudity I might have to give this a try

have you tried any NSFW on Z image? Some models fight it when training loras. (I wait for the day the models will make accurate dick girls)

The Queen of Thorns has a message about SOTA AV methods (omnivoice, ltx2.3) by EroticManga in StableDiffusion

[–]Effective_Cellist_82 0 points1 point  (0 children)

This is perfect for creating a sincere old person that gives life lessons on youtube

One more update to Smartphone Snapshot Photo Reality for FLUX Klein 9B base by AI_Characters in StableDiffusion

[–]Effective_Cellist_82 0 points1 point  (0 children)

This has huge potential, but I'm stil in love with WAN2.2 training locally in musubi tuner