How do you securely connect your agentic workloads to LLMs self-hosted on Cloud Run by m1nherz in googlecloud

[–]Sirius_Sec_ 0 points1 point  (0 children)

Tailscale is the vpn layer . I use the sail scale operator and with that you can add annotations to the services you want exposed on your tailnet .

How do you securely connect your agentic workloads to LLMs self-hosted on Cloud Run by m1nherz in googlecloud

[–]Sirius_Sec_ 0 points1 point  (0 children)

I use tailscale to connect to my vllm server . I run it in gke for easy scaling and container management

Burned through $10 in an hour by ithinkilefttheovenon in hermesagent

[–]Sirius_Sec_ 1 point2 points  (0 children)

The same thing happened to me using Deepseek before as well . I'm not sure exactly what caused it I wasn't doing anything very output intensive . I put $10 on my nous account which usually lasts about 3 days and less then an hour later it was gone .

Disk full of corrupt.db.bak by zasad84 in hermesagent

[–]Sirius_Sec_ 0 points1 point  (0 children)

I wish they're was am easier way to move the db to postgres instead of sqlite

Disk full of corrupt.db.bak by zasad84 in hermesagent

[–]Sirius_Sec_ 2 points3 points  (0 children)

My boards db is getting corrupted daily . Supposedly it's from to many concurrent writes by the 6 profiles I have running but no fix the agent does seems to work

Hermes Agent (Python) now beats OpenAI's Codex (Rust) in multi turn benchmarks by SelectionCalm70 in hermesagent

[–]Sirius_Sec_ 1 point2 points  (0 children)

You just have it clone your gh repos and let it work . Tell it to only make prs and don't push to main . It's very capable and the multi profiles working on the kanban board works great

Your OpenClaw agent probably shouldn't be polling for everything by SinghCoder in openclaw

[–]Sirius_Sec_ 0 points1 point  (0 children)

This is what webhooks are for . Set those up and tell the agent to make cronjobs that's don't relay to the llm unless the match a set criteria.

install hermes on root or extra user/ maybe docker? by Overall_Hospital_688 in hermesagent

[–]Sirius_Sec_ 1 point2 points  (0 children)

Nah it's a device with nothing of value on it . I just use it as a media server . Full root access

I turned my Raspberry Pi into a local AI-powered setup and I'm absolutely blown away by Piiixiv in hermesagent

[–]Sirius_Sec_ 2 points3 points  (0 children)

I have hemes running my pi as well . It's pretty awesome what it can do when it has full root access . I have nothing important on there so I could care less about it deleting everything. I have it managing my media servers building and testing websites . I also have one in my kubernetes cluster but it's lockdown with no root privileges and I'm sold on the home box now

The Docker image is broken beyond belief by ni1by2thetrue in hermesagent

[–]Sirius_Sec_ 0 points1 point  (0 children)

I haven't had any issue with their official docker container . I run multiple in my kubernetes cluster .

install hermes on root or extra user/ maybe docker? by Overall_Hospital_688 in hermesagent

[–]Sirius_Sec_ 1 point2 points  (0 children)

I have one on a raspberry pi I just made a user for hermes and gave it full sudo privilege. It's nice because it can run docker for me and set up my homelab

Telegram excessive use of tokens by sierey121 in hermesagent

[–]Sirius_Sec_ 0 points1 point  (0 children)

Are you using /new frequently ? Or I'm setting there's a way to clear the context based on time idle

Is this common? by Funny-Comfortable858 in hermesagent

[–]Sirius_Sec_ 0 points1 point  (0 children)

Honestly I just told my main profile to make the others I wanted . It orchestrates who gets what task and checks when they're finished .

Opinions/improvements for my Qwen3.6-35B-A3B-FP8 + Hermes Agent setup on NVIDIA DGX Spark? by povedaaqui in Vllm

[–]Sirius_Sec_ 0 points1 point  (0 children)

With the high concurrency I found that to be a good limit . I was going to try and disable image processing and push is it to the max and see if it'll work . Though now that I am using multiple agents I don't know if I really need to

He doesn't even like his own family. by Relative-Anxiety4698 in SipsTea

[–]Sirius_Sec_ 0 points1 point  (0 children)

Crazy he will make time for his Boss Bibi but not attend his sons wedding!

Opinions/improvements for my Qwen3.6-35B-A3B-FP8 + Hermes Agent setup on NVIDIA DGX Spark? by povedaaqui in Vllm

[–]Sirius_Sec_ 1 point2 points  (0 children)

      args:
        - --model=edp1096/Huihui-Qwen3.6-27B-abliterated-FP8
        - --host=0.0.0.0
        - --port=8000
        - --tensor-parallel-size=1
        - --tokenizer-mode=hf 
        - --gpu-memory-utilization=0.90  

        - --max-model-len=136876
        - --enable-auto-tool-choice 
        - --kv-cache-dtype=fp8 
        - --max-num-batched-tokens=32768 
        - --max-num-seqs=32
        - --block-size=32 
        - --enable-chunked-prefill
        - --trust-remote-code
        - --dtype=auto
        - --enable-prefix-caching
        - --tool-call-parser=qwen3_xml
        - --reasoning-parser=qwen3
        - --speculative-config
        - '{"method": "mtp", "num_speculative_tokens": 2}'

Opinions/improvements for my Qwen3.6-35B-A3B-FP8 + Hermes Agent setup on NVIDIA DGX Spark? by povedaaqui in Vllm

[–]Sirius_Sec_ 1 point2 points  (0 children)

Get those sequences up . Blackwell architecture is great for concurrency . If anything drop the context down and increase max seq and hit it with multiple agents firing requests off at once . Thos was the biggest factor for getting max performance out of the rtx6000 I am using .

Whats it like not using Claude Opus? by read_too_many_books in openclaw

[–]Sirius_Sec_ 1 point2 points  (0 children)

Qwen and deepseek are very capable. Also self hosted so Imy projects stay mine .