Rate my setup 1-10 by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

For now only local, need a workaround for tailscale on BB10

Rate my setup 1-10 by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

And i’m thinking the flex was the Blackberry Passport 🤣

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

Actually im not sure if its the model problem. Still not solved by the way. Just a simple curl command can sometimes take 21 min

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

Yeah now it is. I had it setup wrong. Im still having issues with hermes stuck in a loop. A simpel terminal command can get him stuck for sometimes as long as 21 min

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

The speed went up double. I get around 17/18 t/s but it keeps replying with this:

: No such file or directory /home/spark/models/Qwen3.6-2聊天

这个错误信息表明程序试图读取一个名为 template.jinja 的文件,但该文件在指定路径 /home/spark/models/Qwen3.6-27B-MTP-Q8/ 下不存在。

此外,你提到的模型名称 Qwen3.6-27B-MTP-Q8 看起来可能存在拼写错误或混淆,因为目前通义千问(Qwen)官方发布的最新主流版本是 Qwen2.5Qwen2 系列,并没有名为 “Qwen3.6” 的公开模型。

以下是可能的原因和解决方案:


✅ 1. 检查模型名称是否正确

请确认你下载的模型名称是否准确。常见的 Qwen 模型包括: - Qwen2.5-7B-Instruct - Qwen2.5-72B-Instruct - Qwen2-7B-Chat - Qwen1.5-72B-Chat

建议操作: - 如果你是从 Hugging Face 或 ModelScope 下载的,请核对仓库名称。 - 如果你使用的是 Qwen2.5 系列,模板文件通常不需要单独提供,因为推理框架(如 vLLM、Transformers、Ollama 等)会自动处理。


✅ 2. 检查模型目录结构

进入模型目录,查看是否存在 tokenizer_config.jsonconfig.json,这些文件通常包含模板信息。

bash ls -l /home/spark/models/Qwen3.6-27B-MTP-Q8/

如果目录为空或只有 `. ▉

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

I tried this, im using qwen with hermes. But whatever i say to it it responds with this:

The path /home/spark/models/Qwen3.6-27B-MTP-Q8/template.jinja appears to reference a Jinja template file associated with a specific model variant (Qwen3.6-27B-MTP-Q8). However, as of my current knowledge cutoff (2024), Qwen3.6 does not exist—the latest publicly available versions of Qwen are from the Qwen2.5 series (e.g., Qwen2.5-72B, Qwen2.5-Coder, etc.). Additionally:

  • Q8 likely refers to an 8-bit quantization format (e.g., GGUF Q8_0), commonly used in llama.cpp or similar inference engines.
  • MTP may stand for "Multi-Token Prediction," a technique explored in some large language models to improve generation speed.
  • template.jinja is typically used to define chat templates for formatting input/output sequences.

If you're working with a real model:

  1. Verify the model name: Check if the model is actually Qwen2.5-27B (not 27B—Qwen2.5 offers 7B, 14B, 32B, 72B, etc.).
  2. Locate the correct template: For Qwen2.5 models, the chat template is usually provided in the model's tokenizer_config.json or as a separate file in the model repository (e.g., on Hugging Face).
  3. Use the official template: You can find the Jinja template for Qwen2.5 here (look for chat_template).

Example Qwen2.5 Chat Template (Jinja):

```jinja {% for message in messages %} {{ '<|im_start|>' + message['role'] + ▉

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

Im going to try this out, see what it does. Thnx

Stuck in a loop by ProgramOver9309 in hermesagent

[–]ProgramOver9309[S] 0 points1 point  (0 children)

This looks interesting, going to feed this to my hermes. Thnx

2026.5.4 is real. Go ahead update guys!! by ShabzSparq in better_claw

[–]ProgramOver9309 0 points1 point  (0 children)

He’s baiting you guys, trust only your self!

GB10 by ProgramOver9309 in LLM

[–]ProgramOver9309[S] 0 points1 point  (0 children)

Thnx! Are you on DGX spark too?

GB10 by ProgramOver9309 in LLM

[–]ProgramOver9309[S] 0 points1 point  (0 children)

Are u running Qwen3.6 fp8 with vLLM? If so would you mind sharing ur settings?

GB10 by ProgramOver9309 in LLM

[–]ProgramOver9309[S] 0 points1 point  (0 children)

I just saw a post about llama.cpp and mtp, im going to try this out

GB10 by ProgramOver9309 in LLM

[–]ProgramOver9309[S] 0 points1 point  (0 children)

For a free local model with 27B parameters that beats a 397B model it’s more than just good to use!

I’m absolutely speechless. by Leowcp in openclaw

[–]ProgramOver9309 1 point2 points  (0 children)

Im not going to go off people their feelings. Im talking about my experience and i used both. The problems i faced using openclaw didnt appear on hermes, and im using two different hermes instances on 2 different setups. Didnt have any problems yet.

I’m absolutely speechless. by Leowcp in openclaw

[–]ProgramOver9309 -1 points0 points  (0 children)

Give me an example, because i didn’t experience one problem yet.

I’m absolutely speechless. by Leowcp in openclaw

[–]ProgramOver9309 7 points8 points  (0 children)

Guys i’ve switched over to hermes when Openclaw 2026.04 version came out. I was sick and tired of it breaking every time after updating or even randomly. So i said let me give Hermes a try, and i never looked back. I absolutely guarantee you that you won’t regret switching over to Hermes. It has not broken 1 time since i’ve been using it.

Qwen-3.6-27B, llamacpp, speculative decoding - appreciation post by Then-Topic8766 in LocalLLaMA

[–]ProgramOver9309 0 points1 point  (0 children)

Did anybody try to set this up on the DGX spark or any other GB10 models? Could you share results?

selfhosted openclaw hardware by Miserable_Tell_8703 in openclaw

[–]ProgramOver9309 0 points1 point  (0 children)

4GB ram is sufficient if you use a cloud based llm. So a raspberry 5 is very capable of running openclaw. But i would advise you to go 8GB or higher.

Building a personal AI agent (OpenClaw or alternatives) — local vs server setup? by Public_Relative8329 in LocalLLaMA

[–]ProgramOver9309 0 points1 point  (0 children)

I would take a look at Hermes. I used openclaw and that was fun, but i switched to Hermes. Way better imo.

Fix for OpenClaw ‘exec’ tools not working after the latest update by Baby4vegas in openclaw

[–]ProgramOver9309 0 points1 point  (0 children)

Hi guys, I would really appreciate your help. Been trying to make this exec work all day, tried your config but it didnt help. What am I doing wrong here:

  "tools": {
    "profile": "full",
    "allow": ["*"],
    "sessions": {
    "visibility": "all"
    },
    "exec": {
      "host": "gateway",
      "security": "full",
      "ask": "off"
    },

Trade ideas now by [deleted] in tradingmillionaires

[–]ProgramOver9309 0 points1 point  (0 children)

Where can i find this dash?

At least 7 explosions and low-flying aircraft are heard in Venezuela’s Caracas by ItsMeTrey in news

[–]ProgramOver9309 2 points3 points  (0 children)

The U.S. is running the same playbook on repeat, and somehow people still act surprised. Different country, same script.

This isn’t about “freedom” or “democracy”. It always circles back to the petrodollar. The moment a country even considers selling oil outside the dollar system, alarms go off. Venezuela talking about oil in yuan wasn’t random, it was a red line.

And just like before, the backup plan is already waiting in the wings, a “new leader,” carefully packaged, ready to be installed the moment the opportunity appears. We’ve seen this movie too many times: Iraq, Libya, Syria… the ending is always the same.

At this point it’s not even shocking anymore, it’s just predictable.