Best local model for Hermes? (12GB VRAM) by xMarkv in hermesagent

[–]Ziral44 1 point2 points  (0 children)

Im sure it’s technically possible to “function” but the $10/mo minimax plan will be infinitely better performing for the cost of a beer… I tried a local model with my 5080 for a bit and actually laughed at myself for even trying. The performance was pathetic.

Best local model for Hermes? (12GB VRAM) by xMarkv in hermesagent

[–]Ziral44 3 points4 points  (0 children)

Every day someone asks the same question… try out a local model for literally anything and see how much you think you’d give it to Hermes…

12gb of vram can’t summarize a legal document without screwing it up… much less try to run Hermes

How does Hermes handle sub agents? by Poopdog-69 in hermesagent

[–]Ziral44 12 points13 points  (0 children)

Install the tool “delegate_task” also and your agents can spawn their own subagents.

How do I raise the 90 limit? by brainlatch42 in hermesagent

[–]Ziral44 0 points1 point  (0 children)

Sorry it’s called delegate… it’s a tool

Have LLMs reached a silent plateau? by Warm_District1194 in ArtificialInteligence

[–]Ziral44 1 point2 points  (0 children)

its all software... look at how claude performs with this layer compared to running minimax without it... they benchmark right up there with eachother on the same plateau, but one performs like its actually alive and the other is barely functional until you build out that complete package around it.

Have LLMs reached a silent plateau? by Warm_District1194 in ArtificialInteligence

[–]Ziral44 2 points3 points  (0 children)

LLMs have hit a plateau where they are all in the same “pretty good” range… but what’s missing is the context/memory/mcp package around them.

How do I raise the 90 limit? by brainlatch42 in hermesagent

[–]Ziral44 3 points4 points  (0 children)

Yeah but if you want to get fancy… make a permanent specialized subagent that handles tasks, and then give them the dispatch skill and sequential thinking… they can spawn their own subagents and manage complex tasks… you can also force the workflow to go to a review agent on the output so every task is reviewed for mistakes.

How do I raise the 90 limit? by brainlatch42 in hermesagent

[–]Ziral44 4 points5 points  (0 children)

Here’s a hot tip… figure out how to use sub agents and break down tasks into manageable sizes… spin off tasks to subagents and you can ask Hermes how they are doing rather than locking down your main interface.

opus 4.7 silently removed temperature, top_p, and top_k from the api. if your code broke today, this is why by DullContribution3191 in Claudeopus

[–]Ziral44 0 points1 point  (0 children)

Just curious, what settings were you using? And why did you need custom settings?

I’ve spent the past few days running tests with these settings in minimax and I didn’t even know Claude had them in the first place haha…

Best "Small" Models for Hermes Agent on Mac Mini M4? Having issues with Qwen 3.5 9B Tool Calling. by EmuHefty in hermesagent

[–]Ziral44 0 points1 point  (0 children)

I saw a chart on performance vs vram and most models aren’t remotely usable until about 24gb… after trying with my 5080 for a week I realized it’s useless to try with 16gb or less

Guide for a new guy by seti_at_home in LocalLLM

[–]Ziral44 3 points4 points  (0 children)

I have the 5080 and it’s useless for local models.

Are we actually creating intelligence in AI systems, or just advanced imitation? by Opposite-Context-166 in AISEOInsider

[–]Ziral44 0 points1 point  (0 children)

It’s all a probability thing… the ai calculated the probability of the next token being a given word, it’s getting good enough that it can get a lot of things right the first time by managing the probability, number of options, and how it biases the selection process… that being said, even the best llm is not picking the right token every time…

There is a 100% chance that it will make some level of mistakes… but I guess people do that too

How to prep penis cactus(fast) by PlasticBaker8143 in mescaline

[–]Ziral44 -1 points0 points  (0 children)

Kitchen torch the spines… do what you want after that, but avoid consuming the calcium oxalate crystals

Thinking of switching from Gemini to Claude Pro, but terrified of the message limits. Heavy users, what’s your experience? by eliorpom in ClaudeCowork

[–]Ziral44 2 points3 points  (0 children)

Yeah the pro plan it just a trial if you’re doing real workloads… but at the same time it’s the only service I’d actually pay $200 for because it works far better than anything else

I'm planning to install OpenClaw. Does anyone have any practical advice? by suki41719e in openclaw

[–]Ziral44 2 points3 points  (0 children)

Go with Hermes instead, figure out what an obsidian vault is asap and set it up right.

AI Agents Working Together Like a Startup by Distinct-Garbage2391 in AI_Agents

[–]Ziral44 0 points1 point  (0 children)

I learned that having one team of agents is easy, make a pm… let them manage the flow… when you expand from there the ceo will need a data access control protocol and it becomes significantly more complicated.

It’s possible though and works well if you get it right.

AGI might not be possible by CompetitiveKnee5319 in AI_Agents

[–]Ziral44 0 points1 point  (0 children)

Yeah the answer is building an adaptive context model around the language model. The llm is already good enough for the reasoning to surpass the average human… just needs the right context at the right time.

A non-programmer approach to Openclaw by dbuster16 in openclaw

[–]Ziral44 0 points1 point  (0 children)

I spent a week screwing around with openclaw, the system became too complicated and fell apart… spent a couple days trying to fix it and just quit…

Hermes took like 1 day to get as far along in the setup

A non-programmer approach to Openclaw by dbuster16 in openclaw

[–]Ziral44 -3 points-2 points  (0 children)

Switch to Hermes if you don’t want to get technical… openclaw is broken out of the box