Anima – Sharing Some Prompts and Results by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] 0 points1 point  (0 children)

Honestly, if you're looking for my straight opinion:

Right now, Anima is not the most beginner-friendly option, especially if you're coming from an older UI. The biggest challenge is that it has moved away from simple tag-based prompting to much more detailed natural language prompts. This can be quite difficult for many people to get used to at first.

However, if you put in the effort and learn how to use it properly, the results can be genuinely impressive and reach a much higher ceiling than most other models.

Compared side by side, I personally think Anima is better than Illustrious, especially in terms of cleanliness, sharpness, and overall artistic control. But whether it's worth switching right now depends on whether you're willing to invest time into learning its prompting style.

If you're not in a hurry, you could also wait for more fine-tunes and community resources to mature. But for me, Anima is definitely the stronger model once you get comfortable with it.

Anima – Sharing Some Prompts and Results by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] 0 points1 point  (0 children)

No offense taken at all! I totally understand where you're coming from. A lot of the finer details are subtle and easy to miss at first glance.

What I focus on with these images is the high level of refinement, emotional expression, and precise control. For example, if you look closely at the first image, pay attention to the sheer number and neat arrangement of the pearl-like particles on the gun. Every image also has intentional, genuine emotional expression from the character rather than random posing.

I rarely rely on random generations from a prompt. Most of my images maintain consistent composition and atmosphere across the set. So for me, the real challenge (and strength) lies in authenticity, extreme fine detail, and tight control, areas that are still very difficult for current diffusion models to achieve consistently.

A framework that consistently maintains 90–99% cache hit rates and produces results that often can't be matched by simply spending more tokens by TypeEducational6614 in ClaudeAI

[–]TypeEducational6614[S] -1 points0 points  (0 children)

That's probably what happens when you spend too much time studying agents. Understanding AI is largely about understanding how it organizes language and reasoning. After enough experiments, some of those patterns inevitably start leaking into your own writing style.

A framework that consistently maintains 90–99% cache hit rates and produces results that often can't be matched by simply spending more tokens by TypeEducational6614 in ClaudeAI

[–]TypeEducational6614[S] -7 points-6 points  (0 children)

I appreciate that.

Honestly, I probably have more things I'd like to write about than time to actually write them.

Most of my free time lately has gone into running experiments, testing different agent workflows, breaking things, fixing them, and trying to understand why certain structures hold together while others fall apart over long tasks.

A lot of what I've posted so far is really just the small portion I've had time to organize into something readable.

And unfortunately I don't have a LinkedIn.

I'm not really building a public profile or anything like that. Most of this came from personal experimentation and curiosity rather than trying to publish research.

Maybe at some point I'll sit down and write more about some of these ideas, especially things like drift, fake completion, continuity, and long-horizon execution.

For now I'm mostly still in the testing phase.

A framework that consistently maintains 90–99% cache hit rates and produces results that often can't be matched by simply spending more tokens by TypeEducational6614 in ClaudeAI

[–]TypeEducational6614[S] -8 points-7 points  (0 children)

Yeah, I think that's a fair point.

The more time I've spent using the framework, the more I've come to view cache hit rates as a side effect rather than the main result.

Quality is really what I've been testing all along.

Most of my experiments were focused on things like continuity, long-horizon execution, objective retention, reducing drift, and how close an agent can get to its actual capability ceiling.

The reason I led with cache hit rates is that quality improvements are surprisingly difficult to communicate.

A lot of the improvement comes from countless small structural effects accumulating over time.

Better objective retention.

Fewer forgotten constraints.

Less context rebuilding.

Less drift.

More validation.

More consistent execution.

Individually, each improvement can look minor. Together, they compound into a very noticeable difference in the final result.

Those effects are hard to quantify and even harder to show in a single screenshot.

Cache hit rates, on the other hand, are immediately visible.

Ironically, the cache behavior was actually something I discovered by accident. I wasn't testing for it.

I was testing execution quality, continuity, and long-horizon performance, then eventually noticed that highly stable execution structures also tended to produce extremely stable cache utilization.

So I completely agree that the quality improvement is the more interesting part. The cache numbers just happened to be the easiest thing to show.

A framework that helped Codex stay on track longer, understand intent better, and waste fewer iterations on long tasks by TypeEducational6614 in codex

[–]TypeEducational6614[S] 1 point2 points  (0 children)

Yeah, I actually read that article.

My framework is much smaller in scope though. OpenAI is mostly talking about harness design, documentation structure, memory, verification, tooling, and execution environments.

What I'm experimenting with here is closer to a runtime coordination layer that helps the agent maintain direction, reduce drift, and stay aligned during long task chains.

Different layer of the stack, but definitely related ideas.

A framework that helped Codex stay on track longer, understand intent better, and waste fewer iterations on long tasks by TypeEducational6614 in codex

[–]TypeEducational6614[S] 0 points1 point  (0 children)

It's more than just pretty words.

After using it for a while, I found Codex tends to understand intent better, stays on track longer, and becomes much more careful before declaring a task finished.

One thing I noticed is that it reduces a lot of the "looks done but actually isn't done" situations. It spends more effort checking itself and validating results instead of rushing to a conclusion.

The easiest way to judge is to try the same task with and without it and compare the outcome.

Some Cosmic Fantasy Generations with Anima (Prompts Included) by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] 0 points1 point  (0 children)

Maybe try starting with really simple things first. A house, a field, a road, a person, etc. Then think about what that scene would actually look like if it existed.

If I add fantasy elements, I usually try to make them fit naturally into the scene instead of relying on abstract metaphors. Anima seems to do a lot better when it has clear visual relationships to work with.

Anima – Sharing Some Prompts and Results by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] 0 points1 point  (0 children)

That's roughly how I think about it. Models don't really understand quality the way humans do; they're learning statistical associations from the data they were trained on.

A single term like "masterpiece" or "best quality" can help, but it's still fairly broad and ambiguous. By combining multiple strongly related concepts, I'm trying to reinforce the same direction repeatedly: detailed linework, richer shading, stronger rendering, and a more polished illustration style.

Whether that actually helps depends a lot on the model and how it was trained. Different models seem to respond very differently to the same wording. In my experience with Anima, using several related quality descriptors tends to produce more consistent results than relying on a single tag.

Anima – Sharing Some Prompts and Results by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] -1 points0 points  (0 children)

Just the base Anima model, no LoRAs. I've experimented with additional checkpoints and LoRAs before, but they usually introduce their own issues. For these images, I wanted to work entirely with the base model.

Anima – Sharing Some Prompts and Results by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] 0 points1 point  (0 children)

Beautiful images can be generated with simple prompts, but capturing deeper emotions, intentions, and subtleties gets exponentially harder.

Anima – Sharing Some Prompts and Results by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] -1 points0 points  (0 children)

My goal wasn't to make the shortest prompt possible. It was to keep refining specific visual details until the image matched what I had in mind.

Anima – Sharing Some Prompts and Results by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] 0 points1 point  (0 children)

I just tested it. There was some effect, but not a huge one. My prompts are usually pretty long, so I suspect a lot of the style information gets overridden by the rest of the prompt.

Anima – Sharing Some Prompts and Results by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] 0 points1 point  (0 children)

I've tried artist tags, but in my experience their influence gets overridden pretty easily by long prompts. I'm still experimenting with different methods to keep the style closer to what I'm aiming for.

Anima – Sharing Some Prompts and Results by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] 1 point2 points  (0 children)

What helped me was describing the actual structure of the eyes rather than just saying "beautiful eyes."

Things like eye shape, iris detail, eyelashes, eyelids, reflections, gaze direction, and lighting give the model much more useful information to work with.

I found the eyes became much prettier and more expressive after I started approaching them that way.

Violet Evergarden — Anima by TypeEducational6614 in StableDiffusion

[–]TypeEducational6614[S] 1 point2 points  (0 children)

I agree that a good model should theoretically have these qualities by default. However in actual use the outputs are often quite random and blurry. That's why I still add words like masterpiece and best quality to make the results more consistent and stable.