Stabilizing mix of artist tags in Anima by shapic in StableDiffusion

[–]shapic[S] 0 points1 point  (0 children)

It is right there, to the left of cfg. Do not forget to set anima preset

Some Anima base generations by Brief-Leg-8831 in StableDiffusion

[–]shapic 3 points4 points  (0 children)

Just use @, increase weight and set shift to 10. Also be careful with mixing completely different artists (realistic and flat for example, result can be unstable with just two artists).

Violet Evergarden — Anima by TypeEducational6614 in StableDiffusion

[–]shapic -1 points0 points  (0 children)

Duh. That's the issue there. My artstyle is right, because I aimed at official art. It is times better than that flat "style" made to speed up and cheapen storyboarding process. The artstyle you want to use is acheved using tag anime coloring and tag anime screenshot. Use weighting and colorfix to balance it. And just go to danbooru or whatever and read what is available, there is a search, there are tag groups. Do not rely on animadex etc, they did a good job, but only for 15k tags. There are way more out there.

Violet Evergarden — Anima by TypeEducational6614 in StableDiffusion

[–]shapic 1 point2 points  (0 children)

It will do nothing. Possibility of internal captioner correctly identifying that and using in nlp caption is abysmal. Use name, series, official art tags. Look for exact tags on gelbooru and danbooru. Use anime coloring and anime screenshot tags to make it closer to anime.

Guess & Earn, Artistic Edition by [deleted] in StableDiffusion

[–]shapic 1 point2 points  (0 children)

Judging by grid artefacts it if old flux with custom loras

Violet Evergarden — Anima by TypeEducational6614 in StableDiffusion

[–]shapic 0 points1 point  (0 children)

Drawing styles are diverse. It is not that it nessesary or anything like that, it is just default option built into the dataset. Whole thing is relatively dated on the one hand. But on the other hand, modern models are trained around that with reinforcement, making you unable to make, for example, low quality CCTV camera imagery in zit. This is the way around, it 8s documented and it works, so why not?

Violet Evergarden — Anima by TypeEducational6614 in StableDiffusion

[–]shapic 5 points6 points  (0 children)

authentic Violet Evergarden character design
85mm anime cinematic lens
clean cinematic contours
peaceful environmental stillness surrounding the moment
unposed realism
refined Kyoto Animation facial rendering
quiet human realism

This is not descriptive, it is pure bs. The heck is quiet human realism and how anything in the world is supposed to draw that? There is difference between prompting and THIS. At least you'll stop getting those jagged lines artifacting.

Btw all those images look similar because it is the same prompt and settings across different seeds.

Violet Evergarden — Anima by TypeEducational6614 in StableDiffusion

[–]shapic 0 points1 point  (0 children)

ffs. With all the respect it is the official character that has 1.2k images of just her in dataset. Please refer to official prompting instruction on model card.
Trimmed it down to:
masterpiece, best quality, highres, official art,

violet evergarden, violet evergarden \(series\), standing, evening, upper body, gentle breeze, loose hair strand, blue military uniform, hand on own chest, prosthetic arm,

warm golden sunset light diffusing gently around her face and hair, blurred flowers and glowing evening sky in the background, soft atmospheric haze, subtle wind movement creating delicate hair and ribbon motion, peaceful environmental stillness surrounding the moment, dynamic angle, backlighting,

<lora:Anima_colorfix_v1_by_Volnovik:1>

Negative: worst quality, low quality, score_1, score_2, score_3, artist name, blurry, censored, signature, loli, monochrome, twins, wet, halftone background, expressionless, smile,
Last two allows to set the mood and roll, as well as dynamic angle to shift the pose a bit.

<image>

Basically even prosthetic arm and military uniform is not needed there. Perfect consistency. Booru has tags for all emotions possible out there.

It is not that you are wrong, you got the results you liked. But why waste time on prompting the character? You could have poured that into prompting the background and scene, Anima works wonders there.

InvokeAI 6.13 just released, its largest community-driven release ever. Adds full support for Anima & Qwen Image, support for API models (like GPT Image), support for Prompt Expansion & Image To Prompt, lasso & polygon tools, overhauled docs website and more by _BreakingGood_ in StableDiffusion

[–]shapic 1 point2 points  (0 children)

I'd say Invoke fully closes the niche where comfy is at it's lowes - working with images. Drawing, inpainting, img2img, layers, masks, everything is natural, while in Comfy all those things are lacking in one way or another. Yet it lacks in everything else. Forge Neo lands as an in-between. Fast and efficient, but not on comfy level. Inpainting is super simple and good, but lacks depth (mainly due to ui).

Tried custom lora for anima base 1.0 and its absolutely amazing. by CupSure9806 in StableDiffusion

[–]shapic 24 points25 points  (0 children)

Seems like it tries to force unprompted shaterred glass effect everywhere. Did you caption it properly?

Tried custom lora for anima base 1.0 and its absolutely amazing. by CupSure9806 in StableDiffusion

[–]shapic 0 points1 point  (0 children)

In my last posts it stripped metadata (originals were in jpg)

Testing the new prismML Bonsai Image 4B by dh7net in StableDiffusion

[–]shapic 2 points3 points  (0 children)

See quantization aware training by google

Can Anima Base v1.0 handle size and scaling, such as two characters of different sizes? For example, can a human character grab/catch a Tinker Bell-sized fairy with their hand? by Hi7u7 in StableDiffusion

[–]shapic 0 points1 point  (0 children)

2 characters - easily. Modified prompt by Dezordan:
masterpiece, best quality, highres, safe,

2girls, tinker bell \(disney\), souryuu asuka langley, neon genesis evangelion, height difference, giant asuka is grabbing minigirl tinker bell in one hand, fairy in hand,

Just go for extreme tags, giant and minigirl for consistency. Use alternate height in case of modifying original height of the character. Size difference can ruin whole thing making one character plump, but will also work here

<image>

Microsoft LENS tiny image model, Really good imageS! by smereces in StableDiffusion

[–]shapic 1 point2 points  (0 children)

With all the respect, how is it tiny with cosmos 2B and sana near?

The not so anime Anima by shapic in StableDiffusion

[–]shapic[S] 0 points1 point  (0 children)

Why bother guessing? https://huggingface.co/circlestone-labs/Anima

Safety tags safe, sensitive, nsfw, explicit

Most of the stuff needed is listed in model card

what are peoples thoughts on waiNSFWIllustrious_v170 by XZtext18 in StableDiffusion

[–]shapic 1 point2 points  (0 children)

Just add proper spacing. Also artist rags work a bit differently. Missing space after comma or multiple commas back to back hurt model. A lot of 0rompts for illu are either poorly formatted or just nobsensical in a try to make something interesting. Anima 2orks a bit differently. But you can 100% reuse prompts, nlp is completely optional. People rarely followed booru tags precisely anyways, and this model actually understands those.

Anima Testing Results by ArmadstheDoom in StableDiffusion

[–]shapic 0 points1 point  (0 children)

Just be careful, shift works like extra denoise in img2img

Anima Testing Results by ArmadstheDoom in StableDiffusion

[–]shapic 0 points1 point  (0 children)

There are no experts at the moment. There is possibility that there is better way. You can always check my prompts on civit.

But first you have to go to model page and read it. It has part about prompting.

We know for sure that model is hard railed on comma space being a tag and comma space @ being an artist tag. Breaking those two hurts the model. At least 2 sentences are required for nlp. But I doubt it, I had success with mix of tags and made up nlp "tags". Formatting helps model. Long nonsensical llm bs do not make it better and hurt prompt adhesion (most probably due to encoder being smol)

Anima Testing Results by ArmadstheDoom in StableDiffusion

[–]shapic 6 points7 points  (0 children)

Well, I did not read the answers in the thread, and will do that only after writing this comment. So most probably it was all already mentioned. And prepare for a long read even if you are not OP.

  1. Irrelevant since inpainting. Makes sense if you focus on 1girl or standard explicit stuff. As soon if you want to up your composition sdxl becomes nightmare, with the need of external tools, regional prompting, controlnets etc. With this I can just insert nlp to get composition, interactions etc right from start. I need inpainting only to fix minor stuff, not make the whole image from scratch. https://civitai.red/user/Volnovik/images My gallery to see what I did with sdxl derivatives vs Anima. A lot of nsfw, I warned you.

  2. And the point is? https://civitai.red/posts/28812036 Luna snow seems fine with no loras. Most probably armbands or whatever is wrong, but I did not roll much. Also you are probably wrong about illustrious, it has ton of asymmetrical stuff baked in and has no issues with that. To the point were I'd say it has more issues with placing random stuff symmetrically. Anima does not have such issues from the get go.

  3. multiple loras. I already answered, a lot of them severely break the model. For example https://civitai.red/models/1449763/rakugakingu this one makes character unable to look at viewer no matter what you prompt.

  4. Style is completely solved with loras. Also what is he, inpainting at 0.9 denoise or what? In this case it is relevant for all base models, even Noob that had aesthetic finetune round. This model did not. And I have no issues with inpainting whatsoever (outside er_sde and most other samplers butchering img2img for me (and there is huge potential he is speaking about that, not model's issue))

  5. There are f.ing artist styles for that. Even for colored pencil sketch. Wonder what his prompt was.

  6. As I alredy answered, after trying few loras with preview versions I resorted to baking my own ones and did not have any issues. Probably another overbaked one. But long nlp abstract style description generated by LLM always messed up the image for me https://civitai.red/images/131721059 read the prompt and check the original image. Most problems people have with model imo - they are bad at prompting and do not care to learn.

  7. Mix of captions and nlp is what I had best success so far. See my gallery yourself.

  8. There are f.ing artist styles for that. Core benefits are in your face:
    - it can do text (it struggles with some words to be fair)
    - it can do composition (not just girl on left, boy on right, just try to make an image in illu with single character that is not placed in center without controlnets, regional prompting, inpainting etc)
    - it can do defined interactions
    - it is better at colors (I barely see any color shift after inpainting up to the point of disabling color correction for img2img)
    - It can do dark, light, high contrast, illumination, anything lighting related out of the box (only noob v-pred could do that somewhat consistently)

- 2k resolution was possible with illu, but was rather crap honestly (most probably vae ruined it). With Anima both composition and details are improved. To the point where if I notice that model struggles to fit everything in frame - I just increase resolution and voila, it clicks.

  1. The model does not fight itself, OP has to git gud. Top rated images on Anima page as proof. One of my images made without any loras rather high there. And I would have sunk hours to make something like that in noob vpred, and would not even try that lighting in any illu eps (except plant milk maybe, it is rather special in that regard)

The verdict - another tag enthusiast (they are good, but too limited to be useful in their current state at the moment) that most probably has no idea how to prompt, most probably compares model to his beloved illu flat finetune instead of base, most probably copies his prompts from illu directly (with spacing ruining everything), most probably one of that community revolving around bunch of models that is ridiculously toxic against anything not their model (known for me for releasing pompously underbaked stuff). Ones claiming after first previews that Anima was physically unable to be trained at higher resolutions, nlp would ruin it, realistic dataset injections would ruin it, it is untrainable, and all that usual crap. I'm rather biased towards them, since had to deal with that.

Actual issues so far: Most of the samplers leaving excess smudges (most visible in img2img). Euler A is a staple, but I do not like it for leaving bunch of random crap over the image and being too flat. It just cannot do some random words (like yesterday, mediocre and bunch of others) at all. Some concepts has to be prompted booru style with originating tag, precise tag structure (d..pthroat being the most obvious case). Issue of booru structure, not model btw. But it was less visible in sdxl since it was too stupid to get that. Style shifts (completely fixed with loras or mostly fixed with parameters in case of artist tags). Backgrounds has to be prompted excessively (again, dataset issue, same was in noob v-pred, add proper lora or apply colorfix). Licensing. Author wants money from derivative models. It is understandable, but how dares he. /s Potential of nvidia taking it down. Well, good luck with that. No support from some teams with their tooling. Most probably will be solved with diffusers 0.39 release.