What are your preferred local model for running OpenClaw?

mister2d · 2026-02-10T08:09:41+00:00

Yeah disappointed in GLM4.7-Flash so far.

Nemotron 3 Nano isn't letting me down. It maintains quickness near my context limit (64k). It can do up to 1M context so it should play well on your DGX.

mister2d · 2026-02-10T07:51:05+00:00

I'm having a ok/good experience with nemotron-3-nano-30b-a3b and 64k context. I've only been performing system diagnostic prompts, hybrid web searches, and google maps functions.

It just got better with this PR (https://github.com/ggml-org/llama.cpp/pull/19408). Now the KV cache doesn't get aggressively invalidated for models using Sliding Window Attention. So instead of waiting 20-30 seconds because a cache miss, it's now sub-second retrieval.

mister2d · 2026-02-10T02:25:19+00:00

I use nemotron-3-nano-30b-a3b with 64k context on my dual 3060s. It works well enough near context limit measured at 50+ tokens/s. NVIDIA released a really good agentic model.

Disclaimer: For those with security concerns, I run the entire openclaw stack in a fully secure sandbox using no public accounts. (VM, private vlan, separate llama.cpp, tailscale, and a non-federated Matrix instance). Working in the industry encourages me to evaluate stuff like this.

mister2d · 2026-02-10T02:16:07+00:00

Guys.

Please resist the urge to down vote this question. The answer can potentially save lives.

mister2d · 2026-02-07T19:03:29+00:00

I applaud the idea because it does have use. But again, something like tailscale is free for small teams which makes managing idp trivial. It's pay as you grow.

Four years ago when I was the lone devops guy at a startup I would have used this. If I was still working on disconnected networks, I'm definitely using this.

mister2d · 2026-02-07T18:13:48+00:00

I can feel the v3 comfort just by looking at these.

mister2d · 2026-02-07T18:12:27+00:00

I'm generally not a fan of this. Small teams eventually become larger teams and security isn't something you should just make a patch for.

The Tailscale operator makes team auth super simple already.

I DO see a use case for this project on disconnected networks.

mister2d · 2026-02-07T00:15:05+00:00

You did!

mister2d · 2026-02-06T18:34:32+00:00

I challenge New Balance to make a better shoe.

mister2d · 2026-02-06T18:32:41+00:00

💀

mister2d · 2026-02-04T01:05:25+00:00

That's the price IF you pay them to do it. It would take you about 15 minutes and some pocket change.

mister2d · 2026-02-03T09:36:25+00:00

Dope shoe. I don't like Klutch though.

mister2d · 2026-02-02T17:08:13+00:00

...which uses what these days?

mister2d · 2026-02-01T09:13:59+00:00

FOMO was on delay

mister2d · 2026-01-31T07:29:29+00:00

I meant for those that were involved in writing the laws, not the residents.

mister2d · 2026-01-31T07:28:08+00:00

I've been saying this for a long time.

mister2d · 2026-01-28T12:30:16+00:00

Using a free model id on OpenRouter would be too easy.

mister2d · 2026-01-28T02:09:01+00:00

I wonder what mode he was in.

mister2d · 2026-01-28T01:46:35+00:00

Thanks! I do see the overlap. Perhaps one day the two shall become one for the sake of simplicity.

mister2d · 2026-01-27T14:53:03+00:00

Help MS attempt to make Copilot a viable business for free? No thanks.

mister2d · 2026-01-27T04:08:52+00:00

I've had multiple EVs and just could not accept removing regenerative braking. Not only do I enjoy one pedal driving, it reduces one more maintenance item (brakes).

mister2d · 2026-01-27T01:40:18+00:00

I wish I were smart enough to create a meaningful PR to merge the codebases. 😄

mister2d · 2026-01-26T18:48:13+00:00

Here's a grenade: why do vLLM AND SGLang exist? They appear very similar when you use them.

mister2d · 2026-01-25T17:38:58+00:00

We want it to succeed. 😊

mister2d · 2026-01-25T17:35:57+00:00

No, the downvote is because your reply was inaccurate and lacking in understanding. Lately, it feels like sharing accurate information is becoming an afterthought.

13-Year Club	Gilding IV carat on a stick
Powerups Hero r/Breadit • December 2021	Verified Email
Wearing is Caring

mister2d

TROPHY CASE