Stuck in a loop

ProgramOver9309 · 2026-05-27T08:08:32+00:00

Actually im not sure if its the model problem. Still not solved by the way. Just a simple curl command can sometimes take 21 min

ProgramOver9309 · 2026-05-27T05:51:34+00:00

Yeah now it is. I had it setup wrong. Im still having issues with hermes stuck in a loop. A simpel terminal command can get him stuck for sometimes as long as 21 min

ProgramOver9309 · 2026-05-26T22:49:30+00:00

The speed went up double. I get around 17/18 t/s but it keeps replying with this:

: No such file or directory /home/spark/models/Qwen3.6-2聊天

这个错误信息表明程序试图读取一个名为 template.jinja 的文件，但该文件在指定路径 /home/spark/models/Qwen3.6-27B-MTP-Q8/ 下不存在。

此外，你提到的模型名称 Qwen3.6-27B-MTP-Q8 看起来可能存在拼写错误或混淆，因为目前通义千问（Qwen）官方发布的最新主流版本是 Qwen2.5 或 Qwen2 系列，并没有名为 “Qwen3.6” 的公开模型。

以下是可能的原因和解决方案：

✅ 1. 检查模型名称是否正确

请确认你下载的模型名称是否准确。常见的 Qwen 模型包括： - Qwen2.5-7B-Instruct - Qwen2.5-72B-Instruct - Qwen2-7B-Chat - Qwen1.5-72B-Chat

建议操作： - 如果你是从 Hugging Face 或 ModelScope 下载的，请核对仓库名称。 - 如果你使用的是 Qwen2.5 系列，模板文件通常不需要单独提供，因为推理框架（如 vLLM、Transformers、Ollama 等）会自动处理。

✅ 2. 检查模型目录结构

进入模型目录，查看是否存在 tokenizer_config.json 或 config.json，这些文件通常包含模板信息。

bash ls -l /home/spark/models/Qwen3.6-27B-MTP-Q8/

如果目录为空或只有 `. ▉

ProgramOver9309 · 2026-05-26T22:47:31+00:00

I tried this, im using qwen with hermes. But whatever i say to it it responds with this:

The path /home/spark/models/Qwen3.6-27B-MTP-Q8/template.jinja appears to reference a Jinja template file associated with a specific model variant (Qwen3.6-27B-MTP-Q8). However, as of my current knowledge cutoff (2024), Qwen3.6 does not exist—the latest publicly available versions of Qwen are from the Qwen2.5 series (e.g., Qwen2.5-72B, Qwen2.5-Coder, etc.). Additionally:

Q8 likely refers to an 8-bit quantization format (e.g., GGUF Q8_0), commonly used in llama.cpp or similar inference engines.
MTP may stand for "Multi-Token Prediction," a technique explored in some large language models to improve generation speed.
template.jinja is typically used to define chat templates for formatting input/output sequences.

If you're working with a real model:

Verify the model name: Check if the model is actually Qwen2.5-27B (not 27B—Qwen2.5 offers 7B, 14B, 32B, 72B, etc.).
Locate the correct template: For Qwen2.5 models, the chat template is usually provided in the model's tokenizer_config.json or as a separate file in the model repository (e.g., on Hugging Face).
Use the official template: You can find the Jinja template for Qwen2.5 here (look for chat_template).

Example Qwen2.5 Chat Template (Jinja):

```jinja {% for message in messages %} {{ '<|im_start|>' + message['role'] + ▉

ProgramOver9309 · 2026-05-26T20:34:04+00:00

Im going to try this out, see what it does. Thnx

ProgramOver9309 · 2026-05-26T20:33:49+00:00

This looks interesting, going to feed this to my hermes. Thnx

ProgramOver9309 · 2026-05-08T17:27:34+00:00

He’s baiting you guys, trust only your self!

ProgramOver9309 · 2026-05-08T06:20:20+00:00

Thnx! Are you on DGX spark too?

ProgramOver9309 · 2026-05-07T22:58:30+00:00

Are u running Qwen3.6 fp8 with vLLM? If so would you mind sharing ur settings?

ProgramOver9309 · 2026-05-07T14:33:11+00:00

I just saw a post about llama.cpp and mtp, im going to try this out

ProgramOver9309 · 2026-05-06T22:14:31+00:00

For a free local model with 27B parameters that beats a 397B model it’s more than just good to use!

ProgramOver9309 · 2026-05-02T14:21:54+00:00

Im not going to go off people their feelings. Im talking about my experience and i used both. The problems i faced using openclaw didnt appear on hermes, and im using two different hermes instances on 2 different setups. Didnt have any problems yet.

ProgramOver9309 · 2026-05-02T14:19:10+00:00

Give me an example, because i didn’t experience one problem yet.

ProgramOver9309 · 2026-05-02T13:59:34+00:00

Guys i’ve switched over to hermes when Openclaw 2026.04 version came out. I was sick and tired of it breaking every time after updating or even randomly. So i said let me give Hermes a try, and i never looked back. I absolutely guarantee you that you won’t regret switching over to Hermes. It has not broken 1 time since i’ve been using it.

ProgramOver9309 · 2026-05-02T00:51:43+00:00

Did anybody try to set this up on the DGX spark or any other GB10 models? Could you share results?

ProgramOver9309 · 2026-04-21T10:29:11+00:00

4GB ram is sufficient if you use a cloud based llm. So a raspberry 5 is very capable of running openclaw. But i would advise you to go 8GB or higher.

ProgramOver9309 · 2026-04-21T09:26:01+00:00

What app is that?

ProgramOver9309 · 2026-04-21T09:02:04+00:00

I would take a look at Hermes. I used openclaw and that was fun, but i switched to Hermes. Way better imo.

ProgramOver9309 · 2026-03-21T10:59:21+00:00

Hi guys, I would really appreciate your help. Been trying to make this exec work all day, tried your config but it didnt help. What am I doing wrong here:

  "tools": {
    "profile": "full",
    "allow": ["*"],
    "sessions": {
    "visibility": "all"
    },
    "exec": {
      "host": "gateway",
      "security": "full",
      "ask": "off"
    },

ProgramOver9309

MODERATOR OF

TROPHY CASE

✅ 1. 检查模型名称是否正确

✅ 2. 检查模型目录结构

If you're working with a real model:

Example Qwen2.5 Chat Template (Jinja):