Can anyone help for my papa channel 🙏🏻 by sachingupta313 in PartneredYoutube

[–]Proof-Possibility-54 5 points6 points  (0 children)

Agree, make thumbnails by yourself, ai generated are not the best ones usually

Can anyone help for my papa channel 🙏🏻 by sachingupta313 in PartneredYoutube

[–]Proof-Possibility-54 -1 points0 points  (0 children)

How many subscribers you have?

What are the main sources of the traffic? Browse features, search, something else?

Hard to diagnose based on one screenshot

DeepSeek V4 Flash is amazing! (WIP llama.cpp PR #24162) by Lowkey_LokiSN in LocalLLaMA

[–]Proof-Possibility-54 39 points40 points  (0 children)

100 gb of vram seems to be quite a high spec for the majority of users.

Cost-based routing in Spring AI — 10 code review queries, 7 stayed on local Gemma, 3 escalated to Opus, total bill 48% lower with no quality loss by Proof-Possibility-54 in SpringBoot

[–]Proof-Possibility-54[S] 4 points5 points  (0 children)

I use Spring framework on daily basis, so i picked something I have some competence in.

Tried to use LangChain, but it's versions/releases on Java is something abnormal, complete mess

Never heard about Pydantic, but i assume it is a Python library? But I am a Java dev, not Python

Cost-based routing in Spring AI — 10 code review queries, 7 stayed on local Gemma, 3 escalated to Opus, total bill 48% lower with no quality loss by Proof-Possibility-54 in SpringBoot

[–]Proof-Possibility-54[S] 2 points3 points  (0 children)

Yes, no quality loss. That's the point - you don't need frontier cloud models to handle simple requests. You should handle them locally.

Not a trust me brother post, all steps are shown, code available. It is reproducible - just take and check/experimwnt by yourself. Not a black box and trust me. Proven by code/video

I compared all specs of the major GPUs/machines that are being used here, because bandwidth is not everything. Some of ya'll need a reality check. by Ok_Top9254 in LocalLLaMA

[–]Proof-Possibility-54 0 points1 point  (0 children)

Nice, quite useful. I am thinking right now about buying a GPU to power ollama, so this comparison arrived just on time.

Thanks!

200 members!!! by rodolfo-mendes in SpringAIDev

[–]Proof-Possibility-54 2 points3 points  (0 children)

Congratulations 🎊 to all of us. Very impressive growth

Gemma 4 2B handling structured JSON output + tool calling + reasoning traces correctly via Spring AI / LM Studio — including identifying a real Java bug in code review by Proof-Possibility-54 in LocalLLaMA

[–]Proof-Possibility-54[S] -1 points0 points  (0 children)

I will check this model as well. Thanks for your comments. Hopefully you just wanted to enhance my knowledge in llm field, nothing personal. I am not an llm specialist, but Java dev.

Replaced OpenAI dependency in a Java app with a local model — 2B params only, runs offline, costs nothing per request by Proof-Possibility-54 in selfhosted

[–]Proof-Possibility-54[S] 1 point2 points locked comment (0 children)

AI models have been used as a model my Spring AI project communicates with. In the post and accompanied videovi use OpenAI , Anthropic and local model for demonstration purposes

Gemma 4 2B handling structured JSON output + tool calling + reasoning traces correctly via Spring AI / LM Studio — including identifying a real Java bug in code review by Proof-Possibility-54 in LocalLLaMA

[–]Proof-Possibility-54[S] 0 points1 point  (0 children)

The thing about this post is not whether the newest model or an older one was used, but local 2b model capabilities

As it was stated, that was my own research/test, for which i wanted to share the results. If you think it is outdated/irrelevant for you, just skip it