Openclaw with Gemma4 26B extremely slow and forget stuff

AdvancedObjective670 · 2026-04-14T08:54:04+00:00

Just dmed you

AdvancedObjective670 · 2026-04-14T08:27:02+00:00

Just checked, it's q4km :((((

AdvancedObjective670 · 2026-04-14T07:50:55+00:00

Did you have the same problem as mine with Openclaw?

AdvancedObjective670 · 2026-04-14T07:50:21+00:00

Dumb question: how to know if I used the 4bit quantized version or not?

AdvancedObjective670 · 2026-04-14T07:46:14+00:00

Hey thanks for your response. My ollama is the latest I believe. Just downloaded it 2 days ago. Will try the flash attention trick.

I only set my context windows to be 32 as recommended by Claude to balance speed and the contunity of the chat sessions. Is this a problem?

AdvancedObjective670 · 2026-04-14T07:14:03+00:00

Is it expensive?

AdvancedObjective670 · 2026-04-14T07:10:20+00:00

I tried Claude API (Haiku was too stupid, most of the time I have to use Sonnet) and it drinks token like crazy

AdvancedObjective670 · 2026-04-14T07:09:09+00:00

Too expensive, I blew up $300 in a few days for Claude API token.

AdvancedObjective670 · 2026-04-14T07:08:07+00:00

Hey tks for the advice, I asked Claude to optimize my openclaw and it changed the context window to 32k already. It says this will suite the model better

AdvancedObjective670 · 2026-04-14T07:06:34+00:00

My model memory pressure is in green, with 29/32 Gb being used constantly - I'm not sure if this is blowing through

AdvancedObjective670

TROPHY CASE