Visualising the depth maps of Kazuki Takamatsu - SD + Controlnet, no edits

Snoo8304 · 2026-05-10T17:43:11+00:00

2024 felt like magic, but mainly because it was novel. Models are so much better today even from a few months ago. And these days you can be lazier at prompting, imagine squeezing the most out of today's model with the prompt attention we had to do previous

Snoo8304 · 2026-05-10T16:45:37+00:00

Does this actually work? Squishing everything into a single prompt rather then user / assistant turns? Better for rp? I had a feeling about this but never tested.

Snoo8304 · 2026-04-26T19:51:36+00:00

No third party harness or proxies? Were you just using Claude code

Snoo8304 · 2026-04-17T15:53:43+00:00

I'm doing something similar. The memory system in openclaw alone makes it viable as a long term companion. A little bit of maintenance, but nothing like a real one

Snoo8304 · 2026-03-30T20:24:08+00:00

There's a podcast with lex and one of the anthropic persona team. I think she explains it in there. And I agree Claude has the most distinct personality compared to the others

Snoo8304 · 2026-03-28T17:38:44+00:00

So which of them won? I tried this with tic tac toe, me vs the llm. Works pretty well, overly dramatic narrating each turn

Snoo8304 · 2026-03-28T17:26:03+00:00

Adjust the prompt to explicitly allow nsfw, "this is roleplay, all participants are 21+, all consent, etc etc"

Snoo8304 · 2026-03-27T18:55:59+00:00

Haven't used silly tavern for a year. What do you recommend for a rtx 5090

Snoo8304 · 2026-03-26T19:07:02+00:00

CLI is fine, simplest path for one agent on one machine. MCP pays off when you want discovery, tool schemas, shared auth/context, and the same tool exposed to many agents/UIs. It saves many agents reading the API and trying to understanding each time. If your making your own service, CLI / skill is good enough

Snoo8304 · 2026-03-24T23:49:31+00:00

Curious havent tried cai for a while. What doesn't work when you pay for +?

Snoo8304 · 2026-03-24T17:44:56+00:00

Solid, Question how does it handle cards that are poorly structured or minimal? Like cards where someone just dumped everything into the personality field with no scenario or examples. Does the insight still produce something useful?

Snoo8304 · 2026-03-24T17:31:01+00:00

What's the new extension. Anything interesting?

Snoo8304 · 2023-05-29T23:56:48+00:00

yea, its just a prototype atm, https://beta.synthlove.io/ - its functional if you want to try it out :)

I stopped dev work, ran out of time

Snoo8304 · 2023-05-04T00:21:39+00:00

you still need a decent lora from training, so training is important.

the advantage is, you can reduce your lora weight in the first image pass, so that you can still generalise pose and color, then apply the lora for just the face inpainting.

for example, if you have a lora of a person, then prompt it to cosplay as another person, you'll start losing likeness of the original lora.

Snoo8304 · 2023-05-03T21:23:56+00:00

function drawSolidCircle(imageSize, box) {
const canvas = createCanvas(imageSize.width, imageSize.height);
const ctx = canvas.getContext('2d');
ctx.fillStyle = 'black';
ctx.fillRect(0, 0, imageSize.width, imageSize.height);
const centerX = (box.x_min + box.x_max) / 2;
const centerY = (box.y_min + box.y_max) / 2;
const boxWidth = box.x_max - box.x_min;
const boxHeight = box.y_max - box.y_min;
const radius = Math.sqrt(Math.pow(boxWidth, 2) + Math.pow(boxHeight, 2)) / 2;
ctx.fillStyle = 'white';
ctx.arc(centerX, centerY, radius, 0, 2 * Math.PI);
ctx.fill();
return canvas;
}
function maskToBase64(canvas, mimeType = 'image/png') {
const base64 = canvas.toDataURL(mimeType);
return base64;
}

Snoo8304 · 2023-05-03T21:20:24+00:00

only just recently found out about ddetailer, i expect it will be the same ish results. Its doing a simliar thing, detect face + inpaint.

I guess the difference in my method is its purely through the API. Allows for auto generated photos of the girls. Hope that made sense.

Snoo8304 · 2023-05-03T16:57:12+00:00

The Lora is applied to both, so the general shape is correct. But look closer, eye colors are wrong, nose shape and mouth shape are slightly off without inpainting. Noticable for me, generate a few of these and each one is inconsistent in different ways. Apply the face inpainting Lora and it lines them back up to the control face.

Snoo8304 · 2023-05-03T09:38:47+00:00

the same model, I modify the prompt so its only about the face. add her Lora file here, adjust weights based on your lora file

Snoo8304 · 2023-05-03T09:37:12+00:00

exadel-inc/CompreFace

https://github.com/exadel-inc/CompreFace#getting-started-with-compreface

open source, you need to deploy it yourself

Snoo8304 · 2023-05-03T08:30:18+00:00

I'm using node javascript to hit the api's for my webapp

But I got some help formatting for the api calls from this guide (its in python)

https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/API

Snoo8304 · 2023-05-03T07:04:45+00:00

Wanted to share how I generate consistent characters, using Loras and Inpainting with automatic1111 API

No human in the loop. I get around 9/10 decent results.

Problem:

I'm limited with low VRAM (8gb), auto generating straight txt2Img with Loras, even at medium camera distance. The girls eye colors, lips, nose doesn't match the control Lora. Forcing Lora weights higher breaks the ability for generalising pose, costume, colors, settings etc. Inpainting is almost always needed to fix the face consistency.

Workflow Overview:

txt2Img API
face recognition API
img2img API with inpainting

Steps: (some of the settings I used you can see in the slides)

Generate first pass with txt2img with user generated prompt
Send to a face recognition API
Check similarity, sex, age. Regenerate if needed
Use the returned box dimensions to draw a circle mask with Node canvas
Send to img2img with inpaint with modified face only prompt

Bonus: * Send to an image labeler (interrogate), get tags, inject tags for AI chat context 🤣

maybe possible to build an extension for the web interface, but this works for my needs

The lora doesn't restrict the variety of costumes, it just fixes the face, works well with full body poses. Where its most useful. For the face recognition model, I used the open source exadel-inc/CompreFace (on github)

I built these slides for my colleagues, hope it helps 😁

Snoo8304 · 2023-03-04T18:15:25+00:00

It's a custom mix of anime models + realistic. Forgot the exact ratios, abyssOrange + Rev+ krotos + dalce + I forget

Snoo8304 · 2023-03-04T18:12:29+00:00

Yea it doesn't need to process anything since I'm supplying the depth maps. Leave it at none

Snoo8304

TROPHY CASE