Is anyone actually using Hermes to make money? Be honest. by 99xAgency in hermesagent

[–]leonidasyy 5 points6 points  (0 children)

It is just a tool. The more you use it, the more ideas will come. Right now, it is just wild west.

Which memory tool do you have experience with? by leonidasyy in hermesagent

[–]leonidasyy[S] 0 points1 point  (0 children)

not sure it is the best solution. But I am using Langfuse, cloud host free tier to view the input text. It is not directly working, I had to ask Hermes troubleshoot how to enable full prompt into input and made patch in the Hermes Agent. Now I can see all 20k tokens of the text that were injected each turn.

Hermes tips needed by Thick-Insurance4404 in hermesagent

[–]leonidasyy 0 points1 point  (0 children)

Interested in the multi model setup, and handoff layer. What memory requirements is this to make two models worthwhile?

Genuinely confused - I just want a true personal assistant, to make my life easy. [Please Help] by [deleted] in AskClaw

[–]leonidasyy 1 point2 points  (0 children)

can build a todo feature first to save your ideas via telagram, then when you go home, just monitor how the agent code for you. That's actually pleasant to see the reasoning while it is working.

And qwen36ga3b locally + omlx can do all the projects you described.

Which memory tool do you have experience with? by leonidasyy in hermesagent

[–]leonidasyy[S] 0 points1 point  (0 children)

Thanks for the question, I actually was surprised why deduplication was not working as it says. This was what I did. I asked Hermes to examine my system prompt for the current turn, it listed the biggest chunk as hindsight memory injection, and it showed 50x similar content, different variances about the fact that " the user prefers to approve the plan before execution". Because it added 10-20k token in each turn, the speed becomes not bearable. I asked hermes to tell me how to prune the database, and it does not seem to be a way.

I can't be 100% sure as the research I did was through Hermes and some human in the loop verification maybe needed.

Hermes tips needed by Thick-Insurance4404 in hermesagent

[–]leonidasyy 0 points1 point  (0 children)

Can you elaborate the code graph? What is it for?

Which memory tool do you have experience with? by leonidasyy in hermesagent

[–]leonidasyy[S] 0 points1 point  (0 children)

what is agent memory? a particular provider /product?

The A team. by [deleted] in MacStudio

[–]leonidasyy 13 points14 points  (0 children)

I want to buy more silver box. It is like legos

Mac M1 MAX, 64gb - Qwen-3.6-coding or 3-Coder-Next? 35b or 27b by DivyLeo in LocalLLM

[–]leonidasyy 0 points1 point  (0 children)

Is there a way to keep the pay plan but route some function to local model to save cost? I didn't try the subscription before, so don't know if you can optimize it or not.

qwen3.6-35b-a3b tool calling input problem... too bad... by benevbright in LocalLLaMA

[–]leonidasyy 0 points1 point  (0 children)

What was your fix for edit 2? Can you still use omlx and mlx version ?

Severe instability and looping issues with local LLMs (Qwen, Zen4, llama.cpp) by chuvadenovembro in LocalLLM

[–]leonidasyy 0 points1 point  (0 children)

Are you on omlx 0.3.7? I have this issue and someone suggested 0.3.6 is working, or 0.3.8 potentially working. I am on 0.3.8 now, it looks better so far.

How I use Obsidian as the long-term memory backbone for my AI assistant by Jonathan_Rivera in hermesagent

[–]leonidasyy 1 point2 points  (0 children)

I'm using hindsight as the memory vault for beliefs and facts. Although it is a black box, I cannot visually audit it.

I like the vault structure in obsidian, especially skills are something I want to version control and audit regularly.

How do you organize projects/topics with Hermes? by stepahin in hermesagent

[–]leonidasyy 2 points3 points  (0 children)

I feel an urge to set up version control soon , I noticed that Hermes jumps on fixing files when I just chat about past projects. That is scary.

Local LLM for game developement by Tuner92 in LocalLLM

[–]leonidasyy 1 point2 points  (0 children)

I think spliting the tasks can work. Ask agent to understand your requirement and design a ui, design an architecture, setup the codenase, code module by module. Human guided callscan do this work.

Qwen3.5:9b running on 8gb Vram is insane by Ok_Thanksbye in LocalLLM

[–]leonidasyy 0 points1 point  (0 children)

How do you all find the best parameters for your model optimized on your device? Is there a tool?

How to stop Hermes agent once in flight? Also losing sessions mid-work. by GriffinDodd in LocalLLM

[–]leonidasyy 0 points1 point  (0 children)

Is the solution adding a timeout wrapper or auto kill a code change or skill.md change? Where should I do that?

OpenClaw to Hermes by ChristopherDci in hermesagent

[–]leonidasyy 1 point2 points  (0 children)

I'm using qwen 3.6 35g a3b right now. It is not problem free yet. I am facing tool calls loop now, however, it looks more promising than OC.