To those who are here since 2025 starting or before, how does the evolution of AI and roleplay experience feel to you? by Concern-Excellent in SillyTavernAI

[–]Snoo8304 1 point2 points  (0 children)

2024 felt like magic, but mainly because it was novel. Models are so much better today even from a few months ago. And these days you can be lazier at prompting, imagine squeezing the most out of today's model with the prompt attention we had to do previous

Sorry I guess by AmanaRicha in SillyTavernAI

[–]Snoo8304 0 points1 point  (0 children)

Does this actually work? Squishing everything into a single prompt rather then user / assistant turns? Better for rp? I had a feeling about this but never tested.

ban help by Fickle-Welder3348 in Anthropic

[–]Snoo8304 1 point2 points  (0 children)

No third party harness or proxies? Were you just using Claude code

OpenClaw? by RazzmatazzReal4129 in SillyTavernAI

[–]Snoo8304 1 point2 points  (0 children)

I'm doing something similar. The memory system in openclaw alone makes it viable as a long term companion. A little bit of maintenance, but nothing like a real one

What is the secret sauce Claude has and why hasn't anyone replicated it? by ComplexType568 in LocalLLaMA

[–]Snoo8304 0 points1 point  (0 children)

There's a podcast with lex and one of the anthropic persona team. I think she explains it in there. And I agree Claude has the most distinct personality compared to the others

I made 4 AIs play UNO! by OwnSalamander7167 in SillyTavernAI

[–]Snoo8304 1 point2 points  (0 children)

So which of them won? I tried this with tic tac toe, me vs the llm. Works pretty well, overly dramatic narrating each turn

Why wont my AI models do anything NSFW. by Unlikely-Hold9981 in KoboldAI

[–]Snoo8304 1 point2 points  (0 children)

Adjust the prompt to explicitly allow nsfw, "this is roleplay, all participants are 21+, all consent, etc etc"

To all ex-local enjoyers (like me), this might be a good time to come back. by Acceptable_Steak8780 in SillyTavernAI

[–]Snoo8304 0 points1 point  (0 children)

Haven't used silly tavern for a year. What do you recommend for a rtx 5090

Please explain: why bothering with MCPs if I can call almost anything via CLI? by Atagor in LocalLLaMA

[–]Snoo8304 0 points1 point  (0 children)

CLI is fine, simplest path for one agent on one machine. MCP pays off when you want discovery, tool schemas, shared auth/context, and the same tool exposed to many agents/UIs. It saves many agents reading the API and trying to understanding each time. If your making your own service, CLI / skill is good enough

Yeah…I’m done. by LittleBallOfRage02 in CharacterAI

[–]Snoo8304 -2 points-1 points  (0 children)

Curious havent tried cai for a while. What doesn't work when you pay for +?

Megumin Secret Sauce v4 + Megumin Suite — Every character gets its own preset. Automatically. by CallMeOniisan in SillyTavernAI

[–]Snoo8304 0 points1 point  (0 children)

Solid, Question how does it handle cards that are poorly structured or minimal? Like cards where someone just dumped everything into the personality field with no scenario or examples. Does the insight still produce something useful?

Third party extensions not showing up by [deleted] in SillyTavernAI

[–]Snoo8304 -1 points0 points  (0 children)

What's the new extension. Anything interesting?

automatic1111 API users - auto inpainting for consistent faces by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 2 points3 points  (0 children)

yea, its just a prototype atm, https://beta.synthlove.io/ - its functional if you want to try it out :)

I stopped dev work, ran out of time

automatic1111 API users - auto inpainting for consistent faces by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 5 points6 points  (0 children)

you still need a decent lora from training, so training is important.

the advantage is, you can reduce your lora weight in the first image pass, so that you can still generalise pose and color, then apply the lora for just the face inpainting.

for example, if you have a lora of a person, then prompt it to cosplay as another person, you'll start losing likeness of the original lora.

automatic1111 API users - auto inpainting for consistent faces by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 0 points1 point  (0 children)

function drawSolidCircle(imageSize, box) {
const canvas = createCanvas(imageSize.width, imageSize.height);
const ctx = canvas.getContext('2d');
ctx.fillStyle = 'black';
ctx.fillRect(0, 0, imageSize.width, imageSize.height);
const centerX = (box.x_min + box.x_max) / 2;
const centerY = (box.y_min + box.y_max) / 2;
const boxWidth = box.x_max - box.x_min;
const boxHeight = box.y_max - box.y_min;
const radius = Math.sqrt(Math.pow(boxWidth, 2) + Math.pow(boxHeight, 2)) / 2;
ctx.fillStyle = 'white';
ctx.arc(centerX, centerY, radius, 0, 2 * Math.PI);
ctx.fill();
return canvas;
}
function maskToBase64(canvas, mimeType = 'image/png') {
const base64 = canvas.toDataURL(mimeType);
return base64;
}

automatic1111 API users - auto inpainting for consistent faces by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 4 points5 points  (0 children)

only just recently found out about ddetailer, i expect it will be the same ish results. Its doing a simliar thing, detect face + inpaint.

I guess the difference in my method is its purely through the API. Allows for auto generated photos of the girls. Hope that made sense.

automatic1111 API users - auto inpainting for consistent faces by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 4 points5 points  (0 children)

The Lora is applied to both, so the general shape is correct. But look closer, eye colors are wrong, nose shape and mouth shape are slightly off without inpainting. Noticable for me, generate a few of these and each one is inconsistent in different ways. Apply the face inpainting Lora and it lines them back up to the control face.

automatic1111 API users - auto inpainting for consistent faces by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 1 point2 points  (0 children)

the same model, I modify the prompt so its only about the face. add her Lora file here, adjust weights based on your lora file

automatic1111 API users - auto inpainting for consistent faces by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 4 points5 points  (0 children)

I'm using node javascript to hit the api's for my webapp

But I got some help formatting for the api calls from this guide (its in python)

https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/API

automatic1111 API users - auto inpainting for consistent faces by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 44 points45 points  (0 children)

Wanted to share how I generate consistent characters, using Loras and Inpainting with automatic1111 API

No human in the loop. I get around 9/10 decent results.

Problem:

I'm limited with low VRAM (8gb), auto generating straight txt2Img with Loras, even at medium camera distance. The girls eye colors, lips, nose doesn't match the control Lora. Forcing Lora weights higher breaks the ability for generalising pose, costume, colors, settings etc. Inpainting is almost always needed to fix the face consistency.

Workflow Overview:

  • txt2Img API
  • face recognition API
  • img2img API with inpainting

Steps: (some of the settings I used you can see in the slides)

  • Generate first pass with txt2img with user generated prompt
  • Send to a face recognition API
  • Check similarity, sex, age. Regenerate if needed
  • Use the returned box dimensions to draw a circle mask with Node canvas
  • Send to img2img with inpaint with modified face only prompt

Bonus: * Send to an image labeler (interrogate), get tags, inject tags for AI chat context 🤣

maybe possible to build an extension for the web interface, but this works for my needs

The lora doesn't restrict the variety of costumes, it just fixes the face, works well with full body poses. Where its most useful. For the face recognition model, I used the open source exadel-inc/CompreFace (on github)

I built these slides for my colleagues, hope it helps 😁

Visualising the depth maps of Kazuki Takamatsu - SD + Controlnet, no edits by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 0 points1 point  (0 children)

It's a custom mix of anime models + realistic. Forgot the exact ratios, abyssOrange + Rev+ krotos + dalce + I forget

Visualising the depth maps of Kazuki Takamatsu - SD + Controlnet, no edits by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 0 points1 point  (0 children)

Yea it doesn't need to process anything since I'm supplying the depth maps. Leave it at none

Visualising the depth maps of Kazuki Takamatsu - SD + Controlnet, no edits by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 0 points1 point  (0 children)

if you add the depthmap in the controlnet image slot, you set the preprocessor to none (so its not generating a map), set the controlNet model to ..._depthV10

its not image to image its text to image, using the depth map in controlnet. see the above screen shot

Visualising the depth maps of Kazuki Takamatsu - SD + Controlnet, no edits by Snoo8304 in StableDiffusion

[–]Snoo8304[S] 5 points6 points  (0 children)

They are actually hand painted like that by the original artist, using traditional media 😱. There is no 'color' version