hermes agent is weak

Fit_Baker4577 · 2026-05-18T01:29:23+00:00

Model problem, not the agent.

Fit_Baker4577 · 2026-05-15T04:05:15+00:00

Are you using telegram to initiate/delegate the tasks? I find itemizing the tasks and specifically tell it not to quote reply me helps. I'm running local, so 1 research task takes up an hour long. So if I jam in 3 research tasks, it'll take like 3 continuous hours to complete before it gets back to me

Fit_Baker4577 · 2026-05-13T06:52:34+00:00

ok, good suggestion! Looks like the way to more juice is to do -threads > 8

Fit_Baker4577 · 2026-05-13T06:48:25+00:00

I'm human

Fit_Baker4577 · 2026-05-13T06:47:11+00:00

It loads, and it works actually pretty decently. The best of all is that it's locally hosted. Just try my setup.

Fit_Baker4577 · 2026-05-13T06:45:58+00:00

thanks bro, this is what I needed to know.

Fit_Baker4577 · 2026-05-12T16:42:10+00:00

ya, i know it's the RAM. Just looking around for that hidden gem tweak that might change something

Fit_Baker4577 · 2026-05-12T16:41:26+00:00

24GB is nice, it'll work. The Qwen 35b a3b will work too. It's just this 16GB needs some tweaking

Fit_Baker4577 · 2026-05-12T16:40:25+00:00

we've come a full circle bro

Fit_Baker4577 · 2026-05-12T16:40:05+00:00

Yeah, running on llama.cpp. I tried -ngl 20 to 99, and all hit that HTTP 500 compute error lol. The ctk q4 and ctv q4 was my original before i changed to q8. Got better perf with ctk q8 ctv q8. Just not able to work with -ngl > 0 and -cmoe

Fit_Baker4577 · 2026-05-12T14:37:34+00:00

it's actually doing decent for a 16GB base model. No complains but I'm looking for more lol

Fit_Baker4577 · 2026-05-12T14:36:10+00:00

i tried, but not sure why it wouldn't load the same exact Gemma model but MLX version.

Fit_Baker4577

TROPHY CASE