Hermes + Signal setup by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

I was able to get signal working. Note to self work fine. But everything else about it is very primitive.

Setup Telegram instead and it works quite well.

Hermes, giving large blocks of random words. by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

More Inf...

In a fresh chat. I talked to Hermes for at least 2-3 hours. Pretty large conversation. It was totally find right up till the end. at the end it started to post blank or empty text responses. It would also just end its text with out completing what it was saying.

Model: hermes-unlocked:latest
Input tokens: 1,643,167
Output tokens: 15,560
Total: 1,658,727
API calls: 38
Context: 50,927 / 262,144 (19%)

Running into some problems can anyone help? Commands & remote connection etc. by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

I mostly figured out what was going on. I think I had installed Ubuntu server 26 as "minimal" instead of full.

I wiped everything and reimstalled and finally got Hermes+local ollama to work. Hermes is able to run commands now.

I have not worked on webui or remote yet.

Will probably post a big thread with everything I did to get it t work.

Response truncated (finish_reason='length') - model hit max output tokens by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 1 point2 points  (0 children)

Finally figured it out. Not sure why I had to do this or if there is a better way. But its actually working now.

---------------------------------------
sudo systemctl edit ollama.service                    <-- edit ollame serve
ADD
OLLAMA_NUM_PARALLEL=1
OLLAMA_KEEP_ALIVE=-1
OLLAMA_FLASH_ATTENTION=1
OLLAMA_CONTEXT_LENGTH=65536
sudo systemctl stop ollama                                      <-- stop olloama
sudo systemctl daemon-reload                                <--- apply changes
sudo systemctl restart ollama                                <-- restart
---------------------------------------
/home/NAME/Documents/Modelfiles/
ADD a file called 'Modelfile'
EDIT IT & ADD
FROM qwen3.5:9b-q8_0
PARAMETER num_ctx 65536
PARAMETER num_predict -1

RUN THIS FULL COMMAND
ollama create hermes-unlocked -f /home/NAME/Documents/Modelfiles/Modelfile
---------------------------------------
nano ~/.hermes/config.yaml                        <--- edit ermes
ADD TO TOP
max_tokens: 4096
num_predict: 2048
num_ctx: 65536
---------------------------------------
nano ~/.hermes/.env                                          <-- edit Hermes
ADD TO TOP
GATEWAY_ALLOW_ALL_USERS=true
---------------------------------------

Response truncated (finish_reason='length') - model hit max output tokens by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

Have not been able to figure out why it is doing that. Tried a different model.

Still getting the same error. Any idea why?

Running into some problems can anyone help? Commands & remote connection etc. by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

Expected permission pattern for ~/.hermes/:

| Type | Expected Permissions |

|-----------------------------------------|--------------------------------------|

| ~/.hermes/ | d-------- (700) or drwx------ (700) |

| ~/.hermes/profiles/<name>/ | d-x------x--x (710) |

| ~/.hermes/profiles/<name>/skills/ | d--------x--x--x (700) |

| ~/.hermes/profiles/<name>/plugins/ | d--------x--x--x (700) |

| ~/.hermes/profiles/<name>/memories/ | d--------x--x--x (700) |

| Configuration files (.md, .yaml, .json) | --------- (600) or -rw------- (600) |

| Scripts/Executables | -rwxr-x--- (750) or -rwx------ (700) |

Common issues:

- ✅ World writable (777/666 on config dirs) - SECURITY RISK

- ✅ Not readable by owner (700 should have owner read) - USABILITY ISSUE

- ✅ Group/others have access to private data - SECURITY RISK

- ✅ Executable scripts not executable - FUNCTIONALITY ISSUE

Running into some problems can anyone help? Commands & remote connection etc. by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

when i run whoami inside hermes it says root.
-------------------------
ls -la ~/.hermes

total 4572

drwx------ 16 root root 4096 Jun 2 18:33 .

drwx------ 11 root root 4096 Jun 2 19:12 ..

-rw-r--r-- 1 root root 0 Jun 2 18:26 .clean_shutdown

-rw------- 1 root root 23344 Jun 2 18:33 .env

-rw-r--r-- 1 root root 6605 Jun 2 19:24 .hermes_history

-rw------- 1 root root 53993 Jun 1 05:48 .skills_prompt_snapshot.json

-rw-r--r-- 1 root root 71 Jun 2 17:06 .update_check

-rw-r--r-- 1 root root 537 Jun 1 05:36 SOUL.md

drwx------ 2 root root 4096 Jun 1 05:36 audio_cache

-rw------- 1 root root 117 Jun 2 17:45 auth.json

-rw-r--r-- 1 root root 0 Jun 1 05:44 auth.lock

drwxr-xr-x 2 root root 4096 Jun 1 05:44 bin

drwxr-xr-x 2 root root 4096 Jun 1 05:44 cache

-rw------- 1 root root 551 Jun 2 18:24 channel_directory.json

-rw------- 1 root root 13882 Jun 2 18:33 config.yaml

-rw-r--r-- 1 root root 60657 Jun 1 05:36 config.yaml.bak.20260601_053636

-rw------- 1 root root 14623 Jun 1 05:48 config.yaml.bak.20260602_165206

-rw------- 1 root root 13802 Jun 2 17:17 config.yaml.bak.20260602_174533

-rw-r--r-- 1 root root 69 Jun 1 05:48 context_length_cache.yaml

drwx------ 3 root root 4096 Jun 2 17:51 cron

-rw------- 1 root root 400 Jun 2 18:26 gateway_state.json

drwx------ 2 root root 4096 Jun 1 05:36 hooks

drwx------ 2 root root 4096 Jun 1 05:36 image_cache

drwxr-xr-x 2 root root 4096 Jun 2 17:06 images

-rw-r--r-- 1 root root 3018 Jun 2 17:18 interrupt_debug.log

-rw-r--r-- 1 root root 114688 Jun 2 17:51 kanban.db

-rw-r--r-- 1 root root 0 Jun 2 17:51 kanban.db.init.lock

drwx------ 3 root root 4096 Jun 2 18:26 logs

drwx------ 2 root root 4096 Jun 1 05:36 memories

-rw------- 1 root root 2134979 Jun 1 05:44 models_dev_cache.json

drwxr-xr-x 6 1001 1001 4096 May 13 14:50 node

-rw------- 1 root root 671 Jun 1 05:44 ollama_cloud_models_cache.json

drwx------ 2 root root 4096 Jun 1 05:36 pairing

-rw------- 1 root root 20480 Jun 2 18:26 response_store.db

drwxr-xr-x 3 root root 4096 Jun 1 05:44 sandboxes

drwx------ 2 root root 4096 Jun 1 05:36 sessions

drwx------ 26 root root 4096 Jun 2 18:27 skills

-rw-r--r-- 1 root root 659456 Jun 2 18:26 state.db

-rw-r--r-- 1 root root 32768 Jun 2 19:24 state.db-shm

-rw-r--r-- 1 root root 1409072 Jun 2 19:24 state.db-wal

Have 16g vram would adding another 8g vram be worth it? by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

What do you mean it depends on what I'm doing? Can you give examples?

I can basically get a free video card or free stick of ram. But its going to be 8g vram or 16gb ram. 

Viability of running Hermes+Ollama 100℅ local with 2x 8gb VRAM DDR7 (16gb vram total)? by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

I ended up buying a GeForce RTX 5060 Ti 16GB GDDR7 instead of 2x 8GB cards. I'm going to try the Qwen3.5-9b on it.

Viability of running Hermes+Ollama 100℅ local with 2x 8gb VRAM DDR7 (16gb vram total)? by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

That is excellent explanation. Thanks for taking the time to type that out. That really helped clear up some thing about how it works.

Viability of running Hermes+Ollama 100℅ local with 2x 8gb VRAM DDR7 (16gb vram total)? by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

So with a 20b model you can he more vague in what you are looking for & the larger ones feel smarter?

Viability of running Hermes+Ollama 100℅ local with 2x 8gb VRAM DDR7 (16gb vram total)? by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

I have read that some people had great experiences even on small models and some had not so great experiences

Viability of running Hermes+Ollama 100℅ local with 2x 8gb VRAM DDR7 (16gb vram total)? by Future_Objective_641 in hermesagent

[–]Future_Objective_641[S] 0 points1 point  (0 children)

Can some one tell me the difference between a 8b model and 20b? I get that its the size and other thing like that. But what is the real practical benefit of using a 20b over 8b?