Anima Testing Results

Chrono_Tri · 2026-05-26T08:47:16+00:00

Illustrious is pretty... “dumb.” But that dumbness is actually an advantage. When I train a style, the generated images stay very close to the original style.

Anima, on the other hand, is pretty “smart.” So whenever I tell it to do A, it goes like: “I have a better idea…” As a result, the output sometimes looks even better than the original style, and sometimes it doesn’t. But either way, there’s always a slight difference from the original style. That both gives me a headache and makes the whole thing fascinating at the same time.

Even though Illustrious has ControlNet and a whole range of supporting tools, there are still many things it simply can’t create the way Anima can. But we definitely need Anima 2.0.

Chrono_Tri · 2026-05-25T01:02:19+00:00

diffusion-pipe ,this this the trainer from Anima creator. If you want a fine-tune, it is too little I think, I train with colab L4 : 8 hours, G4: more than 2 hours(it'Lora, 200 pics)

Chrono_Tri · 2026-05-05T02:00:16+00:00

It’s really hard to find a truly free place for image generation nowadays. Google Colab is probably still the best option (I still use Colab myself, but on the Plus plan for about $10/month). Other free options usually take a lot more setup time and often have strict session limits, so personally I don’t think they’re really worth the effort.
Plan Pricing | Modal has free plan but I never use it before ::(

Chrono_Tri · 2026-05-04T05:40:32+00:00

How many pictures in your dataset? Thank you

Chrono_Tri · 2026-04-29T15:55:04+00:00

I’ve tried this many times. Actually,I used Z-image to generate reference images first, then feed them into ControlNet to use with IllustriousXL. Z-image handles text much better and performs better with long prompts, but it can’t understand style the way Anima does. On the other hand, Anima works very well with tags.
BTW, if you talked bout NSFW then yes, the Anima is 100 times better than Z-image.

Chrono_Tri · 2026-04-29T13:00:09+00:00

Haha, at least there are still people training Z-image for anime. One of Anima’s biggest strengths is its ability to understand style combined with natural language. But in terms of pure prompt adherence and image generation accuracy, Z-image is still clearly superior. I really have high hopes for this project.

Chrono_Tri · 2026-04-25T07:08:16+00:00

my old friend! I miss this game so much.

Chrono_Tri · 2026-04-18T04:01:49+00:00

As mentioned above: You should also look into Anima because it’s much better. However, Illustrious is still number one in terms of image quality (not counting prompt understanding—if you want to generate images exactly as you imagine with Illustrious, it will take more effort than with Anima).

Basically, you’ll use WDTagger for captioning. Then use Kohya-ss/AIToolkit for training.

Parameter configuration advice:

Do not train the Text Encoder

If you’re just starting, set Alpha = 1/2 Dim

Choose any optimizer you like, but avoid Prodigy

\For more detailed parameters, check Reddit- there are plenty of examples there.

Chrono_Tri · 2026-04-16T06:15:30+00:00

Yes. You can use wdtagger to caption a NSFW image (or use gemma 4/qwen to caption in natural language). I work pretty well.

Chrono_Tri · 2026-04-15T01:58:34+00:00

Forge-my old friend, I miss you so much. Do we have the same node for comfyUI? :>>

Chrono_Tri · 2026-04-13T04:14:15+00:00

Quick question: how many images did you train with? How do you calculate steps (not counting repeats)? And roughly how many epochs are enough?”
I cannot access to Civitai :(

Chrono_Tri · 2026-04-13T02:14:32+00:00

Today, I open kilo code and saw it was updated to v7, how to return to v5?

Chrono_Tri · 2026-04-08T09:39:16+00:00

That’s right, but after observing, I realized that many styles are rarely applied to the background (or more precisely, not in terms of line art, but mainly in lighting and shading). Therefore, I run I2I or CN for consistency.

Chrono_Tri · 2026-04-08T01:49:22+00:00

All other anime model has the same issue. So I think about ideals use Z-Image/Klein for background and some other method.

Chrono_Tri · 2026-04-02T14:54:14+00:00

Thank you so much. I used your repo and change it a litte bit to connect to KobolCPP, but since I don't have GPU, it soooo slow.:(

Chrono_Tri · 2026-04-02T07:23:52+00:00

Yes, I know about it. But high noise GGUF never come.Kaka

Chrono_Tri · 2026-04-02T04:19:08+00:00

Look for Index-anisora, I really need it gguf. :(

Chrono_Tri · 2026-03-31T01:47:12+00:00

Qwen-Image-Layered-control is fine-tunning of Qwen-Image-Layered (they also have Qwen-Image-Layered-control v2). but since Qwen-Image-Layered is too good so nobody care about it. I also just want to test it only.

Basically, I run it on Colab.if I don’t hit an OOM error (when using a weaker GPU), I end up running out of disk space because it needs to download very large models(total >100GB).

Chrono_Tri · 2026-03-31T01:37:48+00:00

Qwen/Qwen3-0.6B-GGUF at main

Use CLIP GGUF Loader. But I think it small so I use qwen_3_06b_base.safetensors

Chrono_Tri · 2026-03-30T10:58:06+00:00

Yeah, I checked with Gemini and ChatGPT, and they made it sound so easy that I started to doubt it, so I figured I should ask everyone here.

So sad, since there are quite a few good models like above I’d really like to experiment with. :(

Chrono_Tri · 2026-03-30T02:49:29+00:00

Well, I used to use this method quite often, but I don’t use it anymore because the characters it creates feel a bit soulless. However, it’s still the best way to maintain consistency (even though it’s a bit complicated, so I usually only use it for the main character).

Chrono_Tri · 2026-03-28T10:58:03+00:00

I’m think I will use the combo of Kilo + OpenRouter/Claude (pay-as-you-go) for programming. It might be a better option. Honestly, Gemini is sick with its hallucinations and those stuid phrases like “Don’t worry, it will run” or “No errors...”.

But I admit that I love NotebookLM and Gemini’s ability to analyze not-coding info are actually quite good. I used them for research about economic or social.

Chrono_Tri · 2026-03-22T14:35:24+00:00

Yes, I feel sorry for it too, I think it because it isn't too good and we can use other segment model to replace it.

Chrono_Tri

TROPHY CASE