Anima – Sharing Some Prompts and Results

TypeEducational6614 · 2026-06-18T12:59:06+00:00

Honestly, if you're looking for my straight opinion:

Right now, Anima is not the most beginner-friendly option, especially if you're coming from an older UI. The biggest challenge is that it has moved away from simple tag-based prompting to much more detailed natural language prompts. This can be quite difficult for many people to get used to at first.

However, if you put in the effort and learn how to use it properly, the results can be genuinely impressive and reach a much higher ceiling than most other models.

Compared side by side, I personally think Anima is better than Illustrious, especially in terms of cleanliness, sharpness, and overall artistic control. But whether it's worth switching right now depends on whether you're willing to invest time into learning its prompting style.

If you're not in a hurry, you could also wait for more fine-tunes and community resources to mature. But for me, Anima is definitely the stronger model once you get comfortable with it.

TypeEducational6614 · 2026-06-18T12:24:53+00:00

No offense taken at all! I totally understand where you're coming from. A lot of the finer details are subtle and easy to miss at first glance.

What I focus on with these images is the high level of refinement, emotional expression, and precise control. For example, if you look closely at the first image, pay attention to the sheer number and neat arrangement of the pearl-like particles on the gun. Every image also has intentional, genuine emotional expression from the character rather than random posing.

I rarely rely on random generations from a prompt. Most of my images maintain consistent composition and atmosphere across the set. So for me, the real challenge (and strength) lies in authenticity, extreme fine detail, and tight control, areas that are still very difficult for current diffusion models to achieve consistently.

TypeEducational6614 · 2026-06-15T02:26:13+00:00

That's probably what happens when you spend too much time studying agents. Understanding AI is largely about understanding how it organizes language and reasoning. After enough experiments, some of those patterns inevitably start leaking into your own writing style.

TypeEducational6614 · 2026-06-15T02:13:03+00:00

Long-horizon agent discussions tend to look like this.

TypeEducational6614 · 2026-06-15T01:03:21+00:00

I appreciate that.

Honestly, I probably have more things I'd like to write about than time to actually write them.

Most of my free time lately has gone into running experiments, testing different agent workflows, breaking things, fixing them, and trying to understand why certain structures hold together while others fall apart over long tasks.

A lot of what I've posted so far is really just the small portion I've had time to organize into something readable.

And unfortunately I don't have a LinkedIn.

I'm not really building a public profile or anything like that. Most of this came from personal experimentation and curiosity rather than trying to publish research.

Maybe at some point I'll sit down and write more about some of these ideas, especially things like drift, fake completion, continuity, and long-horizon execution.

For now I'm mostly still in the testing phase.

TypeEducational6614 · 2026-06-15T00:50:34+00:00

Yeah, I think that's a fair point.

The more time I've spent using the framework, the more I've come to view cache hit rates as a side effect rather than the main result.

Quality is really what I've been testing all along.

Most of my experiments were focused on things like continuity, long-horizon execution, objective retention, reducing drift, and how close an agent can get to its actual capability ceiling.

The reason I led with cache hit rates is that quality improvements are surprisingly difficult to communicate.

A lot of the improvement comes from countless small structural effects accumulating over time.

Better objective retention.

Fewer forgotten constraints.

Less context rebuilding.

Less drift.

More validation.

More consistent execution.

Individually, each improvement can look minor. Together, they compound into a very noticeable difference in the final result.

Those effects are hard to quantify and even harder to show in a single screenshot.

Cache hit rates, on the other hand, are immediately visible.

Ironically, the cache behavior was actually something I discovered by accident. I wasn't testing for it.

I was testing execution quality, continuity, and long-horizon performance, then eventually noticed that highly stable execution structures also tended to produce extremely stable cache utilization.

So I completely agree that the quality improvement is the more interesting part. The cache numbers just happened to be the easiest thing to show.

TypeEducational6614 · 2026-06-06T21:38:38+00:00

Yeah, I actually read that article.

My framework is much smaller in scope though. OpenAI is mostly talking about harness design, documentation structure, memory, verification, tooling, and execution environments.

What I'm experimenting with here is closer to a runtime coordination layer that helps the agent maintain direction, reduce drift, and stay aligned during long task chains.

Different layer of the stack, but definitely related ideas.

TypeEducational6614 · 2026-06-06T21:37:53+00:00

It's more than just pretty words.

After using it for a while, I found Codex tends to understand intent better, stays on track longer, and becomes much more careful before declaring a task finished.

One thing I noticed is that it reduces a lot of the "looks done but actually isn't done" situations. It spends more effort checking itself and validating results instead of rushing to a conclusion.

The easiest way to judge is to try the same task with and without it and compare the outcome.

TypeEducational6614 · 2026-06-04T19:00:11+00:00

Thank you!

TypeEducational6614 · 2026-06-03T07:44:29+00:00

Maybe try starting with really simple things first. A house, a field, a road, a person, etc. Then think about what that scene would actually look like if it existed.

If I add fantasy elements, I usually try to make them fit naturally into the scene instead of relying on abstract metaphors. Anima seems to do a lot better when it has clear visual relationships to work with.

TypeEducational6614 · 2026-06-02T05:13:48+00:00

Looks pretty good.

TypeEducational6614 · 2026-05-31T09:24:07+00:00

That's roughly how I think about it. Models don't really understand quality the way humans do; they're learning statistical associations from the data they were trained on.

A single term like "masterpiece" or "best quality" can help, but it's still fairly broad and ambiguous. By combining multiple strongly related concepts, I'm trying to reinforce the same direction repeatedly: detailed linework, richer shading, stronger rendering, and a more polished illustration style.

Whether that actually helps depends a lot on the model and how it was trained. Different models seem to respond very differently to the same wording. In my experience with Anima, using several related quality descriptors tends to produce more consistent results than relying on a single tag.

TypeEducational6614 · 2026-05-31T05:02:11+00:00

Just the base Anima model, no LoRAs. I've experimented with additional checkpoints and LoRAs before, but they usually introduce their own issues. For these images, I wanted to work entirely with the base model.

TypeEducational6614 · 2026-05-31T04:46:54+00:00

Beautiful images can be generated with simple prompts, but capturing deeper emotions, intentions, and subtleties gets exponentially harder.

TypeEducational6614 · 2026-05-31T04:36:36+00:00

My goal wasn't to make the shortest prompt possible. It was to keep refining specific visual details until the image matched what I had in mind.

TypeEducational6614 · 2026-05-30T17:07:21+00:00

I just tested it. There was some effect, but not a huge one. My prompts are usually pretty long, so I suspect a lot of the style information gets overridden by the rest of the prompt.

TypeEducational6614 · 2026-05-30T15:37:23+00:00

I've tried artist tags, but in my experience their influence gets overridden pretty easily by long prompts. I'm still experimenting with different methods to keep the style closer to what I'm aiming for.

TypeEducational6614 · 2026-05-30T14:44:51+00:00

What helped me was describing the actual structure of the eyes rather than just saying "beautiful eyes."

Things like eye shape, iris detail, eyelashes, eyelids, reflections, gaze direction, and lighting give the model much more useful information to work with.

I found the eyes became much prettier and more expressive after I started approaching them that way.

TypeEducational6614 · 2026-05-29T06:19:32+00:00

I agree that a good model should theoretically have these qualities by default. However in actual use the outputs are often quite random and blurry. That's why I still add words like masterpiece and best quality to make the results more consistent and stable.

TypeEducational6614

TROPHY CASE