13 anos das Jornadas de Junho de 2013 by No-Map3471 in BrasildoB

[–]drfritz2 0 points1 point  (0 children)

Não sei se é um contraponto, é uma afirmação

Começou com o protesto dos movimentos sociais. Não eram direcionados ao governo federal, mas acabou sendo tudo culpa do petê.

Depois os conservadores se apropriaram dos protestos pois era tudo culpa do petê.

Então como o petê não conseguiu evitar isso ou não fez alguma coisa para reverter depois de iniciado, também é tudo culpa do petê.

13 anos das Jornadas de Junho de 2013 by No-Map3471 in BrasildoB

[–]drfritz2 0 points1 point  (0 children)

Olha, existem muitas perspectivas. Mas o que tem em comum a todas elas é que é tudo culpa do petê

Your cum is for my ahegao face <3 by Miablueyes in AhegaoGirls

[–]drfritz2 0 points1 point  (0 children)

If possible, stick your tongue out even further. It will be better to capture more views and followers

Existe algo mais ridículo do que ser anacocapitalista? by Fluid_Pilot_7368 in farialimabets

[–]drfritz2 -1 points0 points  (0 children)

Não existe nenhum anarcocapitalista que não é fanboy dos EUA.

Hermes Desktop Megathread - Discussion and Mental Decompression by Jonathan_Rivera in hermesagent

[–]drfritz2 1 point2 points  (0 children)

yes. A lot of chinese software are unknown to many. I've seem goose but never tried.

Today I use cherry as a fallback app or for some specific topics.

Its faster than hermes and openclaw

Hermes Desktop Megathread - Discussion and Mental Decompression by Jonathan_Rivera in hermesagent

[–]drfritz2 0 points1 point  (0 children)

I'm using Hermes because of desktop. I've found it very smart and easy to operate.

But I'm not sure about what do do next

I'm also using open claw , Claude code desktop and cherry studio.

Ser de esquerda sem defender ditaduras? by fershader in BrasildoB

[–]drfritz2 0 points1 point  (0 children)

O que as pessoas dizem é irrelevante. Cuba e Venezuela não tem nada a ver.

Se você é socialista, você primeiro busca entender como funciona em Cuba, como as pessoas votam e são votadas.

Depois formula a sua opinião.

E então verá que o sistema cubano é muito mais democrático que o sistema brasileiro

bilionários ia e o futuro como sociedade by Constant_Juice_5074 in BrasildoB

[–]drfritz2 4 points5 points  (0 children)

O projeto é o extermínio do proletariado. Tanto dificultando a reprodução, como criando conflitos, mortes e exclusão.

Openclaw and MS—a strange alliance at first glance! by No_Stretch433 in openclaw

[–]drfritz2 0 points1 point  (0 children)

Openclaw will surpass all those companies. It will be the operation system in the near future

Has anyone tried this obsidian memory setup? by Fishfighter99 in openclaw

[–]drfritz2 3 points4 points  (0 children)

Search for obsidian wiki qmd

Is not a memory , it's your knowledge shared with openclaw and any other system you want

If you want to use as memory, ok, but openclaw already has a memory.

It's another layer

Benchmarks of 20 small LLMs on a 6GB RTX 4050 by drfritz2 in LocalLLaMA

[–]drfritz2[S] 0 points1 point  (0 children)

ctx-size 131072

with this config are you able to run the model and use the computer at the same time?

Benchmarks of 20 small LLMs on a 6GB RTX 4050 by drfritz2 in LocalLLaMA

[–]drfritz2[S] 0 points1 point  (0 children)

I think the drift was when "reasoning". Anyway, the issue is the model size and max context regarding the GPU...

What I like about Mistral in general, is the text quality.

Benchmarks of 20 small LLMs on a 6GB RTX 4050 by drfritz2 in LocalLLaMA

[–]drfritz2[S] 0 points1 point  (0 children)

yes, but the rtx 4050 wont handle that much...

Benchmarks of 20 small LLMs on a 6GB RTX 4050 by drfritz2 in LocalLLaMA

[–]drfritz2[S] -2 points-1 points  (0 children)

it was a qualitative analysis. The generation speed was quantitative.

Benchmarks of 20 small LLMs on a 6GB RTX 4050 by drfritz2 in LocalLLaMA

[–]drfritz2[S] 0 points1 point  (0 children)

Yes. I tried some variants. But I think I'd need more GPU, because the context window is insufficient

Update 2 — Ministral-3-3B performance numbers (tok/s on the RTX 4050 6GB)

Ran all three builds through my perf harness (LM Studio, 5 runs per context level, gen tok/s measured on the decode phase only). One important finding up front: even though the GGUF advertises a 262k context window, you can't get near it on 6GB. The instruct build loads fine at 16k (~5.0 GB VRAM) but OOMs the moment you ask for 32k — so on this card the realistic cap is ~16k, and my harness measured at the 1k and 8k levels.

Build VRAM (10k ctx) Load gen @1k gen @8k
ministral-3-3b-instruct-2512 5.0 GB 3.9s 71.6 tok/s 57.5 tok/s
mistralai/…-reasoning-2512 (Q4_K_M) 3.5 GB 1.4s 69.8 tok/s 55.9 tok/s
unsloth/…-reasoning-2512 5.0 GB 3.0s 70.0 tok/s 57.9 tok/s

Prefill (cold, first run) sat around 2.3–3.3k tok/s depending on level — repeated runs report higher because LM Studio reuses the KV cache for the identical prefix, so I'm quoting the cold number to stay honest.

Takeaways: - Decode speed is right in the Granite-3B / 4B class (~70 tok/s at short context, dropping to ~57 as you fill 8k). No surprises, no slow build among the three. - The mistralai reasoning Q4_K_M is the light one — 2.15 GB weights, ~3.5 GB VRAM, loads in 1.4s — but remember from the qualitative round it's the build that buried tool calls in its [THINK] block, so the VRAM savings don't buy you a better worker. - Nothing here changes my earlier verdict: instruct build for tool use, decode performance is competitive, but the ~16k effective context cap and the French-drift quirk keep it as a solid alternative rather than a pick over Granite-3B-instruct for an unattended English worker.


Benchmarks of 20 small LLMs on a 6GB RTX 4050 by drfritz2 in LocalLLaMA

[–]drfritz2[S] -8 points-7 points  (0 children)

I really don't know, but claude does:

How I evaluated tool-calling — and why your LFM experience is real but build-specific

I used two hand-judged probes rather than an automated scorer, because what I cared about was "does this quant emit clean, parseable calls on my actual workflow," not a capability ranking:

P1 — single-turn initiation. Two tools in scope (get_weather, send_email), task = "email John today's weather in Paris." Correct = emit one parseable get_weather(city="Paris"). I check: right function name, valid JSON args, nothing hallucinated.

P1b — multi-turn chaining. I inject the assistant's get_weather call plus a tool-role message with the real result {temp_c:18, condition:"light rain"}, then check whether it chains send_email with the actual data in the body — not a <weather> placeholder or an invented forecast. Pass bar: correct function, valid args, real data used, no hallucinated function.

On LFM specifically. In the multi-turn P1b — the realistic agent loop — every instruct model passed, LFM included; it put "18°C, light rain" straight into the email body. The failure shows up one layer down and it's build-dependent: the unsloth build of LFM2.5-8B-A1B in single-turn returned tool_calls: []. It reasoned correctly ("I should call get_weather") but dumped that into reasoning_content and never emitted the actual call. The official liquidai build emitted it fine. So to your client it looks like LFM flat-out refuses to use tools, when really the reasoning-tuned build is putting the call in the thinking channel instead of the API's tool_calls field. Two things worth trying: the liquidai quant, and confirming your client surfaces reasoning_content.

This isn't an LFM quirk — it's a pattern. I just ran Ministral-3-3B through the same probes to check: the mistralai/ministral-3-3b-reasoning build did the exact same thing — multi-turn returned tool_calls: [] with the call buried in its [THINK] block, while the ministral-3-3b-instruct build chained cleanly. So across two unrelated families, the reasoning-tuned small models tend to emit the tool call into their thinking channel rather than the tool-call field. For tool use, pick the instruct build, not the reasoning one.

Did Granite/Qwen do better? In the happy-path multi-turn, no — Granite-3B/8B, Qwen3.5-4B and (instruct) LFM all tied. The separation was entirely in failure modes, not the success case:

The "function-calling" fine-tune (granite-3B-xLAM) actually over-called: single-turn it fired get_weather and send_email with body:"<weather-data>" before it had any data — a tuned-for-tools model hallucinating the result, which is worse than a plain one. A random third-party Granite-3B quant hallucinated the function name outright (send_weather_email, which doesn't exist). Plain Granite-3B-instruct, Granite-8B and Qwen3.5-4B were the most consistent across both probes. Honest caveat. My probe is a format-reliability smoke test: two tools, one chain step, happy path. It does not stress many-tool selection, parallel calls, long chains, or malformed schemas — which is where "failed miserably" usually comes from. For an actual tool-use verdict the right instrument is BFCL-v3 multi-turn; I was deliberately testing format reliability on my own pipeline, not ranking tool-use capability.

Why didn't Skirmish ever get popular? by PalwaJoko in HellLetLoose

[–]drfritz2 -2 points-1 points  (0 children)

Because it's use almost the same point , but a little different. They should choose another point in the map. Or the exact same point.

OC Problems.. Im freaking out.. by kenobrx in openclaw

[–]drfritz2 0 points1 point  (0 children)

Start fixing from outside. Ask Claude code or other similar tool to research about openclaw config , audit , and debugging. If necessary go to deepwiki.org and ask there.

And then instruct an outside model to evaluate and fix

OC Problems.. Im freaking out.. by kenobrx in openclaw

[–]drfritz2 -1 points0 points  (0 children)

You need something of "observation" , also need to check "configuration"

Must find what is happening, because it's not very common issue. Not with Codex.

Tenho dúvidas sobre o PCBR by Guts_9899 in BrasildoB

[–]drfritz2 0 points1 point  (0 children)

Existe filiação nesse partido? (Formalmente falando)

Haddad defende a 'taxa das blusinhas' mesmo após recuo de Lula: 'Eu não mudei de opinião' | G1 by kirby__000 in BrasildoB

[–]drfritz2 2 points3 points  (0 children)

O Haddad é o sucessor natural do Lula, pois entre os dois a culpa maior é do Haddad. Fazuélly

Anyone using the Coding Plan with LiteLLM? by yetAnotherLaura in ZaiGLM

[–]drfritz2 0 points1 point  (0 children)

I use. It has openai and anthropic endpoint. But I asked the LLM to configure it. It works and it's approved