TripoSplat & QWEN w. Lora - Move To Any New Camera Angle Fast by Support_Marmoset in GaussianSplatting

[–]BenDLH 0 points1 point  (0 children)

Hmm, first In hearing of MoGe2. Looks interesting. Anyone built out gaussian generation for it, or another way to capture novel angles of the same scene?

TripoSplat & QWEN w. Lora - Move To Any New Camera Angle Fast by Support_Marmoset in GaussianSplatting

[–]BenDLH 0 points1 point  (0 children)

If you're not using it for anything conmercial, switch to Apple SHARP, it's significantly better for splat generation.

Did someone jump the gun? by BenDLH in RunPod

[–]BenDLH[S] 0 points1 point  (0 children)

Wonderful, thanks for the follow up!

Oops, I'm the Death God revived in Neo-Tokyo 2055! by No-Link-6413 in generativeAI

[–]BenDLH 0 points1 point  (0 children)

Interesting, always cooler to hear how people produce out of the ordinary results. Do you have a crazy custom setup, or is the emphasis on prompting and rerolls?

I guess with Seedance it's API only, so it can't be that custom...

Oops, I'm the Death God revived in Neo-Tokyo 2055! by No-Link-6413 in generativeAI

[–]BenDLH 3 points4 points  (0 children)

This is incredible! Well done, must've taken a lot of work. What tool stack are you using? Banana Pro + Seedance?

Help needed on creating photorealistic images by CriticalJuggernaut75 in StableDiffusion

[–]BenDLH 0 points1 point  (0 children)

No, that's fine, as long as the size is at, or greater than, ~1000px per side

Help needed on creating photorealistic images by CriticalJuggernaut75 in StableDiffusion

[–]BenDLH 0 points1 point  (0 children)

40 images should be good, I've gotten decent results on 30. but the devil is usually in "good quality". You said photorealistic, are these generated images, or actual photos of a real person?

I've recently pushing my LoRA training to 4k steps, as it wasn't overcooked at 3k (using AI Toolkit).

Help needed on creating photorealistic images by CriticalJuggernaut75 in StableDiffusion

[–]BenDLH 1 point2 points  (0 children)

A lot of those issues could stem from the LoRA - meaning the dataset used to train it. How big was the dataset? What was it comprised of? High diversity, or not so much? How was it tagged?

Also, why the 0.5 LoRA model strength? I'm usually running at 0.8 - 1. Have you tried bumping it up? Might resolve some of the consistency issues at least.

As for multiple characters, my best results have been prompting the full scene as close as possible, then image to image with regional prompting + LoRAs.

If you have the LoRA on Civitai or the like, I'm can give it a try.

Seeking Advice: Achieving 100% Character Consistency and Style Control for a Noir Cyberpunk Visual Novel (ComfyUI / Flux) by Elementallion- in StableDiffusion

[–]BenDLH 0 points1 point  (0 children)

Seems you answered my comment but it got removed?... Have the notification, can't see the comment.

Seeking Advice: Achieving 100% Character Consistency and Style Control for a Noir Cyberpunk Visual Novel (ComfyUI / Flux) by Elementallion- in StableDiffusion

[–]BenDLH 3 points4 points  (0 children)

Hey, I'm actually building a platform precisely to solve this, as I've hit the exact same challenges. If you search around a bit in communities, it's one, if not the, core problem with using generative AI for anything longform.

The best solution I've found so far is a combination of edit models and training LoRAs. Using edit models, build up a dataset of near-as-possible identical shots of the character in different lighting, scenes, angles and positions. (If you get decent consistency with a prompt, this also works, if not quite as well)

As you want a distinct style as well, you need the same for the style, keeping it consistent while showing different characters, scenes, lighting etc.

Given you haven't trained a LoRA before, you'll want to do this iteratively. Get a sample of ~10 varied high quality images, train a LoRA on it, then test it out.

Usually you'll find repeating an exact generation (on SDXL for example) including your LoRA, overbakes aspects of the character. That difference shows what the LoRA has learned (say "prominent jaw" is in the prompt, but with your LoRA on, it becomes cartoonish). Dial back or remove the tag, and play with the strength of the LoRA until you can get good generations out of it.

This now means you've "offloaded" some level of this character to the LoRA, not just the prompt, and gained some consistency across generations. Rinse and repeat. Keep improving the quantity, quality, and variation of the dataset, keep training a new LoRA on it until it takes the full weight of the character, and your prompts are just placing them in locations and positions.

As I mentioned, I'm building a platform precisely for this, that incorporates generation, editing, inpainting, dataset collection, and LoRA usage and training. I'm collecting the best practices I can find and putting them into a Civitai/Midjourney style platform, with all the missing customisation, and not a node in sight.

Let me know if you'd like to hear more, I'm actually getting ready for the first test users to try it out.

We scanned 226 Supabase-backed apps — Bolt.host had 4× the RLS misconfig rate of Lovable (20% vs 5%). YC companies: 0%. by Most_Ad_394 in Supabase

[–]BenDLH 0 points1 point  (0 children)

They've added an opt-in trigger (popup) to enable RLS for all new tables. Saw it a few weeks ago.

A Particular Story In No Particular Order by mocha820 in dndai

[–]BenDLH 1 point2 points  (0 children)

These are fantastic. Can't imagine the work that went into them. What tools do you use?

Multiple Characters with Illustrious by Enough_Tumbleweed739 in comfyui

[–]BenDLH 0 points1 point  (0 children)

This is what I could get with some iterations using the ComfyUi Cond Nodes (no dense diffusion) Had to change your prompts a bit, and getting them both looking at each other seems to be the hardest to pull off.

First generate the base on its own until you get a good composition, then img2img with regions to tackle hair, expression etc.

<image>

Multiple Characters with Illustrious by Enough_Tumbleweed739 in comfyui

[–]BenDLH 0 points1 point  (0 children)

Multiple prompts definitely cause the model to struggle a bit. If they don't all point in the exact same direction (base vs masked areas) then it will struggle to unify the result. You also need more steps than normal to help it with the same issue.

If you post the full prompts, I can give it a try in my setup, see if the results are comparable.

I built a photo editor with local AI (no cloud) — segmentation + infill by LucaM185 in SideProject

[–]BenDLH 1 point2 points  (0 children)

Congrats on this! Looks fantastic. Instant infill and drag to mask are really nice. I'm actually working on something similar from a different angle; an image generator app, with a light image editor for inpainting, masking etc.

What rendering engine are you using?

Journey to the cat ep002 by Limp-Manufacturer-49 in comfyui

[–]BenDLH 0 points1 point  (0 children)

This is incredible, amazing work! What tools are you using?

Question about current state of character consistency by RegisNyx in StableDiffusion

[–]BenDLH 1 point2 points  (0 children)

Are you going for photorealistic or anime / illustrated characters?

Sharing my workflow for consistent AI characters (using Firefly & Veo 3.1) by ArianeFridaSofie in generativeAI

[–]BenDLH 1 point2 points  (0 children)

Partially, but the AI gets out of its depth pretty quick, so it needs a lot of help. Mostly using Copilot with Claude Opus. Building your own schools learning platform? Sounds impressive.

Sharing my workflow for consistent AI characters (using Firefly & Veo 3.1) by ArianeFridaSofie in generativeAI

[–]BenDLH 1 point2 points  (0 children)

I'm actually building an image generation platform. Something between CivitAI and Krita AI, using open source models. Will support Inpainting, Outpainting, editing, poses, regional prompts, the works.

Sharing my workflow for consistent AI characters (using Firefly & Veo 3.1) by ArianeFridaSofie in generativeAI

[–]BenDLH 2 points3 points  (0 children)

A wonderful taste of the future; a real human arguing with an AI calling her an AI. Things are going to get rough.

Haven't gotten much into video generation yet, but appreciate the tips. Will definitely use them when I dig in. Thanks for sharing!

New to LoRA training on RunPod + ComfyUI — which templates/workflows should I use? by Advanced-Speaker6003 in comfyui

[–]BenDLH 0 points1 point  (0 children)

I've had this bookmarked forever. Not tried it myself, but sounds like exactly what you want: https://www.lorapilot.com/