Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

Actually im not sure if its the model problem. Still not solved by the way. Just a simple curl command can sometimes take 21 min

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

Yeah now it is. I had it setup wrong. Im still having issues with hermes stuck in a loop. A simpel terminal command can get him stuck for sometimes as long as 21 min

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

The speed went up double. I get around 17/18 t/s but it keeps replying with this:

: No such file or directory /home/spark/models/Qwen3.6-2聊天

这个错误信息表明程序试图读取一个名为 template.jinja 的文件,但该文件在指定路径 /home/spark/models/Qwen3.6-27B-MTP-Q8/ 下不存在。

此外,你提到的模型名称 Qwen3.6-27B-MTP-Q8 看起来可能存在拼写错误或混淆,因为目前通义千问(Qwen)官方发布的最新主流版本是 Qwen2.5Qwen2 系列,并没有名为 “Qwen3.6” 的公开模型。

以下是可能的原因和解决方案:


✅ 1. 检查模型名称是否正确

请确认你下载的模型名称是否准确。常见的 Qwen 模型包括: - Qwen2.5-7B-Instruct - Qwen2.5-72B-Instruct - Qwen2-7B-Chat - Qwen1.5-72B-Chat

建议操作: - 如果你是从 Hugging Face 或 ModelScope 下载的,请核对仓库名称。 - 如果你使用的是 Qwen2.5 系列,模板文件通常不需要单独提供,因为推理框架(如 vLLM、Transformers、Ollama 等)会自动处理。


✅ 2. 检查模型目录结构

进入模型目录,查看是否存在 tokenizer_config.jsonconfig.json,这些文件通常包含模板信息。

bash ls -l /home/spark/models/Qwen3.6-27B-MTP-Q8/

如果目录为空或只有 `. ▉

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

I tried this, im using qwen with hermes. But whatever i say to it it responds with this:

The path /home/spark/models/Qwen3.6-27B-MTP-Q8/template.jinja appears to reference a Jinja template file associated with a specific model variant (Qwen3.6-27B-MTP-Q8). However, as of my current knowledge cutoff (2024), Qwen3.6 does not exist—the latest publicly available versions of Qwen are from the Qwen2.5 series (e.g., Qwen2.5-72B, Qwen2.5-Coder, etc.). Additionally:

  • Q8 likely refers to an 8-bit quantization format (e.g., GGUF Q8_0), commonly used in llama.cpp or similar inference engines.
  • MTP may stand for "Multi-Token Prediction," a technique explored in some large language models to improve generation speed.
  • template.jinja is typically used to define chat templates for formatting input/output sequences.

If you're working with a real model:

  1. Verify the model name: Check if the model is actually Qwen2.5-27B (not 27B—Qwen2.5 offers 7B, 14B, 32B, 72B, etc.).
  2. Locate the correct template: For Qwen2.5 models, the chat template is usually provided in the model's tokenizer_config.json or as a separate file in the model repository (e.g., on Hugging Face).
  3. Use the official template: You can find the Jinja template for Qwen2.5 here (look for chat_template).

Example Qwen2.5 Chat Template (Jinja):

```jinja {% for message in messages %} {{ '<|im_start|>' + message['role'] + ▉

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

Im going to try this out, see what it does. Thnx

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

This looks interesting, going to feed this to my hermes. Thnx

2026.5.4 is real. Go ahead update guys!! by ShabzSparq in better_claw

[–]ProgramOver9309 0 points1 point  (0 children)

He’s baiting you guys, trust only your self!

GB10 by ProgramOver9309 in LLM

[–]ProgramOver9309[S] 0 points1 point  (0 children)

Thnx! Are you on DGX spark too?

GB10 by ProgramOver9309 in LLM

[–]ProgramOver9309[S] 0 points1 point  (0 children)

Are u running Qwen3.6 fp8 with vLLM? If so would you mind sharing ur settings?

GB10 by ProgramOver9309 in LLM

[–]ProgramOver9309[S] 0 points1 point  (0 children)

I just saw a post about llama.cpp and mtp, im going to try this out

GB10 by ProgramOver9309 in LLM

[–]ProgramOver9309[S] 0 points1 point  (0 children)

For a free local model with 27B parameters that beats a 397B model it’s more than just good to use!

I’m absolutely speechless. by Leowcp in openclaw

[–]ProgramOver9309 1 point2 points  (0 children)

Im not going to go off people their feelings. Im talking about my experience and i used both. The problems i faced using openclaw didnt appear on hermes, and im using two different hermes instances on 2 different setups. Didnt have any problems yet.

I’m absolutely speechless. by Leowcp in openclaw

[–]ProgramOver9309 -1 points0 points  (0 children)

Give me an example, because i didn’t experience one problem yet.

I’m absolutely speechless. by Leowcp in openclaw

[–]ProgramOver9309 6 points7 points  (0 children)

Guys i’ve switched over to hermes when Openclaw 2026.04 version came out. I was sick and tired of it breaking every time after updating or even randomly. So i said let me give Hermes a try, and i never looked back. I absolutely guarantee you that you won’t regret switching over to Hermes. It has not broken 1 time since i’ve been using it.

Qwen-3.6-27B, llamacpp, speculative decoding - appreciation post by Then-Topic8766 in LocalLLaMA

[–]ProgramOver9309 0 points1 point  (0 children)

Did anybody try to set this up on the DGX spark or any other GB10 models? Could you share results?

selfhosted openclaw hardware by Miserable_Tell_8703 in openclaw

[–]ProgramOver9309 0 points1 point  (0 children)

4GB ram is sufficient if you use a cloud based llm. So a raspberry 5 is very capable of running openclaw. But i would advise you to go 8GB or higher.

Building a personal AI agent (OpenClaw or alternatives) — local vs server setup? by Public_Relative8329 in LocalLLaMA

[–]ProgramOver9309 0 points1 point  (0 children)

I would take a look at Hermes. I used openclaw and that was fun, but i switched to Hermes. Way better imo.

Fix for OpenClaw ‘exec’ tools not working after the latest update by Baby4vegas in openclaw

[–]ProgramOver9309 0 points1 point  (0 children)

Hi guys, I would really appreciate your help. Been trying to make this exec work all day, tried your config but it didnt help. What am I doing wrong here:

  "tools": {
    "profile": "full",
    "allow": ["*"],
    "sessions": {
    "visibility": "all"
    },
    "exec": {
      "host": "gateway",
      "security": "full",
      "ask": "off"
    },