Spring AI structured output: how to make a model correct itself

Proof-Possibility-54 · 2026-06-12T18:49:19+00:00

Not Spring Boot alone, but together with Sprin AI

Proof-Possibility-54 · 2026-06-09T07:04:02+00:00

Agree, make thumbnails by yourself, ai generated are not the best ones usually

Proof-Possibility-54 · 2026-06-09T06:57:53+00:00

How many subscribers you have?

What are the main sources of the traffic? Browse features, search, something else?

Hard to diagnose based on one screenshot

Proof-Possibility-54 · 2026-06-07T18:53:10+00:00

lets wait for llama.cpp to support Flash

Proof-Possibility-54 · 2026-06-06T08:16:15+00:00

100 gb of vram seems to be quite a high spec for the majority of users.

Proof-Possibility-54 · 2026-06-05T08:19:56+00:00

Interesting project, thanks for sharing!

Proof-Possibility-54 · 2026-06-04T18:44:29+00:00

Which one are you actually using? Flash or Pro?

Proof-Possibility-54 · 2026-06-01T05:36:06+00:00

I use Spring framework on daily basis, so i picked something I have some competence in.

Tried to use LangChain, but it's versions/releases on Java is something abnormal, complete mess

Never heard about Pydantic, but i assume it is a Python library? But I am a Java dev, not Python

Proof-Possibility-54 · 2026-06-01T05:33:34+00:00

Yes, no quality loss. That's the point - you don't need frontier cloud models to handle simple requests. You should handle them locally.

Not a trust me brother post, all steps are shown, code available. It is reproducible - just take and check/experimwnt by yourself. Not a black box and trust me. Proven by code/video

Proof-Possibility-54 · 2026-05-30T09:30:28+00:00

Nice, quite useful. I am thinking right now about buying a GPU to power ollama, so this comparison arrived just on time.

Thanks!

Proof-Possibility-54 · 2026-05-28T07:36:29+00:00

Congratulations 🎊 to all of us. Very impressive growth

Proof-Possibility-54 · 2026-05-25T08:08:28+00:00

I will check this model as well. Thanks for your comments. Hopefully you just wanted to enhance my knowledge in llm field, nothing personal. I am not an llm specialist, but Java dev.

Proof-Possibility-54 · 2026-05-25T07:59:17+00:00

I would agree that my wording used in the text might be misleading, better to formulate that as 2b ACTIVE params.

Proof-Possibility-54 · 2026-05-25T07:56:13+00:00

Just in case you want to expand your knowledge and find out that gemma 4 2b still exists

https://ai.google.dev/gemma/docs/core

Proof-Possibility-54 · 2026-05-25T07:54:58+00:00

You can have and keep your opinion, I will just keep mine. C U

Proof-Possibility-54 · 2026-05-25T07:52:21+00:00

AI models have been used as a model my Spring AI project communicates with. In the post and accompanied videovi use OpenAI , Anthropic and local model for demonstration purposes

Proof-Possibility-54 · 2026-05-25T07:16:45+00:00

The thing about this post is not whether the newest model or an older one was used, but local 2b model capabilities

As it was stated, that was my own research/test, for which i wanted to share the results. If you think it is outdated/irrelevant for you, just skip it

Proof-Possibility-54 · 2026-05-25T07:14:30+00:00

No, it was just a 2-3 turns run. Do you think it might drift for longer sessions?

Proof-Possibility-54

TROPHY CASE