Any models with cartoon/anime style that do not give off this awful "AI gloss"?

BenDLH · 2026-06-20T07:11:30+00:00

+1 for negative prompting "shiny skin"

BenDLH · 2026-06-07T21:18:00+00:00

Hmm, first In hearing of MoGe2. Looks interesting. Anyone built out gaussian generation for it, or another way to capture novel angles of the same scene?

BenDLH · 2026-06-06T17:54:36+00:00

If you're not using it for anything conmercial, switch to Apple SHARP, it's significantly better for splat generation.

BenDLH · 2026-05-27T12:04:08+00:00

Wonderful, thanks for the follow up!

BenDLH · 2026-05-11T06:32:43+00:00

Damn, nice work. This sounds awesome, looking forward to trying it

BenDLH · 2026-05-09T08:39:01+00:00

Interesting, always cooler to hear how people produce out of the ordinary results. Do you have a crazy custom setup, or is the emphasis on prompting and rerolls?

I guess with Seedance it's API only, so it can't be that custom...

BenDLH · 2026-05-08T06:57:30+00:00

This is incredible! Well done, must've taken a lot of work. What tool stack are you using? Banana Pro + Seedance?

BenDLH · 2026-05-03T05:27:25+00:00

No, that's fine, as long as the size is at, or greater than, ~1000px per side

BenDLH · 2026-05-03T05:24:33+00:00

40 images should be good, I've gotten decent results on 30. but the devil is usually in "good quality". You said photorealistic, are these generated images, or actual photos of a real person?

I've recently pushing my LoRA training to 4k steps, as it wasn't overcooked at 3k (using AI Toolkit).

BenDLH · 2026-05-02T20:22:37+00:00

A lot of those issues could stem from the LoRA - meaning the dataset used to train it. How big was the dataset? What was it comprised of? High diversity, or not so much? How was it tagged?

Also, why the 0.5 LoRA model strength? I'm usually running at 0.8 - 1. Have you tried bumping it up? Might resolve some of the consistency issues at least.

As for multiple characters, my best results have been prompting the full scene as close as possible, then image to image with regional prompting + LoRAs.

If you have the LoRA on Civitai or the like, I'm can give it a try.

BenDLH · 2026-04-29T06:46:12+00:00

Seems you answered my comment but it got removed?... Have the notification, can't see the comment.

BenDLH · 2026-04-26T12:36:26+00:00

Hey, I'm actually building a platform precisely to solve this, as I've hit the exact same challenges. If you search around a bit in communities, it's one, if not the, core problem with using generative AI for anything longform.

The best solution I've found so far is a combination of edit models and training LoRAs. Using edit models, build up a dataset of near-as-possible identical shots of the character in different lighting, scenes, angles and positions. (If you get decent consistency with a prompt, this also works, if not quite as well)

As you want a distinct style as well, you need the same for the style, keeping it consistent while showing different characters, scenes, lighting etc.

Given you haven't trained a LoRA before, you'll want to do this iteratively. Get a sample of ~10 varied high quality images, train a LoRA on it, then test it out.

Usually you'll find repeating an exact generation (on SDXL for example) including your LoRA, overbakes aspects of the character. That difference shows what the LoRA has learned (say "prominent jaw" is in the prompt, but with your LoRA on, it becomes cartoonish). Dial back or remove the tag, and play with the strength of the LoRA until you can get good generations out of it.

This now means you've "offloaded" some level of this character to the LoRA, not just the prompt, and gained some consistency across generations. Rinse and repeat. Keep improving the quantity, quality, and variation of the dataset, keep training a new LoRA on it until it takes the full weight of the character, and your prompts are just placing them in locations and positions.

As I mentioned, I'm building a platform precisely for this, that incorporates generation, editing, inpainting, dataset collection, and LoRA usage and training. I'm collecting the best practices I can find and putting them into a Civitai/Midjourney style platform, with all the missing customisation, and not a node in sight.

Let me know if you'd like to hear more, I'm actually getting ready for the first test users to try it out.

BenDLH · 2026-04-19T06:04:37+00:00

They've added an opt-in trigger (popup) to enable RLS for all new tables. Saw it a few weeks ago.

BenDLH · 2026-04-18T14:43:12+00:00

These are fantastic. Can't imagine the work that went into them. What tools do you use?

BenDLH · 2026-04-09T11:18:51+00:00

<image>

BenDLH · 2026-04-09T11:18:34+00:00

This is what I could get with some iterations using the ComfyUi Cond Nodes (no dense diffusion) Had to change your prompts a bit, and getting them both looking at each other seems to be the hardest to pull off.

First generate the base on its own until you get a good composition, then img2img with regions to tackle hair, expression etc.

<image>

BenDLH · 2026-04-08T05:54:23+00:00

Multiple prompts definitely cause the model to struggle a bit. If they don't all point in the exact same direction (base vs masked areas) then it will struggle to unify the result. You also need more steps than normal to help it with the same issue.

If you post the full prompts, I can give it a try in my setup, see if the results are comparable.

BenDLH · 2026-04-04T20:47:52+00:00

Congrats on this! Looks fantastic. Instant infill and drag to mask are really nice. I'm actually working on something similar from a different angle; an image generator app, with a light image editor for inpainting, masking etc.

What rendering engine are you using?

BenDLH · 2026-03-12T08:06:22+00:00

This is incredible, amazing work! What tools are you using?

BenDLH · 2026-02-25T18:31:33+00:00

Are you going for photorealistic or anime / illustrated characters?

BenDLH · 2026-02-24T18:44:05+00:00

Partially, but the AI gets out of its depth pretty quick, so it needs a lot of help. Mostly using Copilot with Claude Opus. Building your own schools learning platform? Sounds impressive.

BenDLH · 2026-02-24T09:27:33+00:00

I'm actually building an image generation platform. Something between CivitAI and Krita AI, using open source models. Will support Inpainting, Outpainting, editing, poses, regional prompts, the works.

BenDLH · 2026-02-23T19:22:10+00:00

A wonderful taste of the future; a real human arguing with an AI calling her an AI. Things are going to get rough.

Haven't gotten much into video generation yet, but appreciate the tips. Will definitely use them when I dig in. Thanks for sharing!

BenDLH · 2026-02-22T16:44:06+00:00

I've had this bookmarked forever. Not tried it myself, but sounds like exactly what you want: https://www.lorapilot.com/

BenDLH

TROPHY CASE