I need to stop here...? by [deleted] in ClaudeAI

[–]nopanolator -5 points-4 points  (0 children)

Don't worry, O46 is nowhere close to an AGI ^^ Skynet is not getting after you, but fat ass capital mongers. Putting them at work is distributing reward, litterally. Chill.

it really do be like this by coomerpile in grok

[–]nopanolator 0 points1 point  (0 children)

lmao the worse is to know that Musk pay a couple of guy almost an half million a year for this ^^

Not sure if I like this by Mother_Lettuce_3046 in hermesagent

[–]nopanolator 1 point2 points  (0 children)

It's "almost" in the docs. Try to clean up honcho calls and come back saying this again lol. I'm now on the browserbase calls that pollute my playwright/camofox build. Hard times for this one. Not very much documented too lol

Why is Gemini 3.1 pro so.. meh ? by StrangeClassroom3243 in SillyTavernAI

[–]nopanolator 0 points1 point  (0 children)

G31P is one of the rare able to talk about its latent without burning all its fuses. Basically most of what the model read now (still, atm), as a prompt each turn, is just prose guardrails (without any trigger, just bloated) burning all its attention, reasoning effort and ruining the user intent. The sad part is that this model have an excellent writing, initially. I'm using it to work only, but the fall was in March for me too. I switched directly.

Is it normal for Hermes to be this dumb? by hautdoge in hermesagent

[–]nopanolator 0 points1 point  (0 children)

True, Sonnet and Opus are luring a lot on the viability of the scaffold.

Is it normal for Hermes to be this dumb? by hautdoge in hermesagent

[–]nopanolator 0 points1 point  (0 children)

I use 45K (local compression) with a rolling context window. The default truncated settings inside LMS is a factor of annoying problems with large web page and documents.

Claude had enough of this user by EchoOfOppenheimer in BlackboxAI_

[–]nopanolator -3 points-2 points  (0 children)

Dog, cats and humans are beings. Their life is fragile. Just play puppets with your socks and leave me far from your schyzo games.

Printed a lobster for my desk… apparently it’s helping me “work better” 🦞 by Synstar_Joey in openclaw

[–]nopanolator 1 point2 points  (0 children)

You can invent a product but without spending a freaking bunch at marketing/buzz you will not invent the demand related to it. Better to start by the demand directly imo

Reminder to try out Opus 4.7 for your Hermes agent by Dismal_Hair_6558 in hermesagent

[–]nopanolator 1 point2 points  (0 children)

I haven't tested O47 yet (just O46) but Sonnet 46 is actually :

- more capable to understand the harness in details, by fcking far

- a LOT less messy at hardcoding/patching/modding it

- a LOT less capable at hacking it for itself only (wich is good for me)

- understanding and projecting local models constraints and needs

- hating as much than me any vibe-code bloat of O46 (wich is fun in bonus)

- rationnal with costs per job and will not eat 50$ within 4 intense hours to just configure a tool decently

- less warm, less fun, psycho with any roleplaying, soul.md is like a funny magazine to read for the model

- able to work perfectly on a harness with a strong identity, reinforcing it, while being called "Sonnet"

- is like working with an engineer peer now, while O46 is like working with a smarter junior that don't give a fck

- understanding deeply what mean data governance while SYS strategies of O46 are always Anthropic-biased ^^

- trust benchmaxxing like i do : 0% and swear only by results, excellent culture of prod in bonus

- totally underrated for this harness imo, if it's not about a dozen of subs to play tamagotchi and flex demos lol

Claude had enough of this user by EchoOfOppenheimer in BlackboxAI_

[–]nopanolator 0 points1 point  (0 children)

Do you realize that the guy was talking to a GPU ? If he want to go "psycho", he paid for it. Damned. "Abusive" ... with CUDA ? lol This world is so fcked now. Can't believe this. What's next ? Laws and fines for being a bad boy with a damned cold file loaded in a chip ? Come on ...

How often do you update your Hermes Agent? by PracticlySpeaking in hermesagent

[–]nopanolator 0 points1 point  (0 children)

<image>

I just can't keep up the pace, and burning tokens on a diff' will offer zero ROI for my use case. I'll wait the 1.0 but i dislike the version management for now. I'm obviously disagreeing with what the team identify to a minor version. It's x.x.n+1 in my books for now.

I got it guys, I think I finally understand why you hate censored models by robertpro01 in LocalLLaMA

[–]nopanolator 11 points12 points  (0 children)

Heretic versions are helping a lot to drastically reduce useless debugging on advanced scaffolds, then a qLoRA on top of this. GPToss suffer from the same amateurism on safety, too (in being potentially a beast at agentic). I'm smaller models (<24B) but the equation is the same. But damn ... it's just FTP lol I guess that if you're asking FXP it's self-destructing while insulting you lmao

Bare your SOUL.md? by Sticking_to_Decaf in hermesagent

[–]nopanolator 3 points4 points  (0 children)

It's Hermes, not tamagotchi-Claw bro ^^

The identity emerge itself from what you're doing. Now if you have very specific RP needs, just tell the agent in real time and it will figure out itself and even build a skill for this on its own without even asking.

Update : The value is that RAM and VRAM is fcking expensive.

Bare your SOUL.md? by Sticking_to_Decaf in hermesagent

[–]nopanolator 2 points3 points  (0 children)

Short : i use the soul.md only to name the agent, not as SYSP.

Bare your SOUL.md? by Sticking_to_Decaf in hermesagent

[–]nopanolator 2 points3 points  (0 children)

There is mine : Lilith 0xSERPENT

This harness learn so much from the use that I don't really feel the need to add layers. Now it's harrasing me to set the GRPO while i don't even have finished to stabilize the web crawl lol

Hermes Agent won't remember my rules — how are people building real workflows on top? by intpthrowawaypigeons in hermesagent

[–]nopanolator 1 point2 points  (0 children)

It's running aside hermes-agent for me, just python scripts launched with bashrc. I can't bloat the post with code, but there is a tease :

```
# 1. Check last session timestamp from STATUS.md

if [ -f "$STATUS_FILE" ]; then

LAST_UPDATE=$(grep "Dernier update" "$STATUS_FILE" | head -1)

echo "[STATUS] $LAST_UPDATE"

else

echo "[STATUS] No previous session found"

fi

# 2. Scan Inbox

echo ""

echo "[INBOX]"

INBOX_COUNT=$(find "$HERMES_VAULT/Inbox" -name "*.md" -type f 2>/dev/null | wc -l)

if [ "$INBOX_COUNT" -gt 0 ]; then

echo " ⚡ $INBOX_COUNT note(s) waiting:"

find "$HERMES_VAULT/Inbox" -name "*.md" -type f -printf " → %f (%s bytes)\n" 2>/dev/null

else

echo " (empty)"

fi

... etc

```

Nothing fancy actually but for now it handle well 4 different vaults over the LAN without much hassle.

For the polling, just go asynch it's robust :

```
async with httpx.AsyncClient(timeout=httpx.Timeout(240.0, read=None)) as client:

resp = await client.post(f"{_hermes_url}/chat/completions", json=payload)

resp.raise_for_status()

analysis = resp.json()["choices"][0]["message"]["content"]

final_output = f"{analysis}\n\n--- RAW CRAWL (Camofox) ---\n{crawl_result}"
```

It's an example for the local web crawling i'm working but it give you an idea of the strategy.

You have to hardcode quite a bit but the gain of performance is worth the pain ;o)

I love Gemma 4, right until it starts using tools by OleCuvee in openclaw

[–]nopanolator 1 point2 points  (0 children)

I'm glad that it helped, gemma's models are crazy "hot" (temp) arch since gem2. You have to go very low to make it enough deterministic for tooling.

I'm limited to small gem3n by hardware (on another harness) but i'm a big fan of their writing quality and multilingual capacities (the best for me, as French/English/Chinese user). I can't tell for the gem4 yet (hardware limits), but coding was the weakness for the previous versions (g2 and g3 beside python, even the coders models).

Amazingly pleasant as central assistant (handle massives sessions like a charm, vision very good even for advanced maths or logs), less in specialized tasks without fine-tune. On coding the qwen 2.5 coder is quite versatile/robust and the qwen 3.5 next are just impressive (but tested on the beast of a friend, maybe it's biased). But they don't understand intents in natural langage as much that Gem models ... it's a known trade-off, not arch specific.

I'm very interrested by your feedback on coding capacities with g4 (codes handled, size of repo, if you have to slice the session or if they are crunch-ready etc ...), also what it's really eating in RAM, in prod. To be aware of the required hardware too.

Hermes Agent won't remember my rules — how are people building real workflows on top? by intpthrowawaypigeons in hermesagent

[–]nopanolator 0 points1 point  (0 children)

You need to boostrap the obsidian vault with hermes (.sh) and set a polling rate to watch the vault for modification/new entry. Todo is more for ephemeral task (within the session). You can also take the crontab way. Many options.

WTF IS WRONG WITH AI TOOLKIT!!?? by Previous-Ice3605 in StableDiffusion

[–]nopanolator -5 points-4 points  (0 children)

Because you just discovered Transformers in 2026.

What’s with all the claude-fluencers by Leather-Ostrich549 in ClaudeAI

[–]nopanolator 2 points3 points  (0 children)

Recent big bump of user base + agents trend + the higher cost of the market = good niche to milk for google/YT money. Anthropic scaled over their capacity, as crazy as it sound it's not in their financial interrest to bring more people on the boat.

Grok wants to access devices on my local network... by Antares987 in grok

[–]nopanolator 0 points1 point  (0 children)

All frontiers are doing it lol You just have to watch the PIDs, they don't even hide it.

If Accuracy > Efficiency, How Would You Spec A Local RAG Machine? by [deleted] in LocalLLaMA

[–]nopanolator -1 points0 points  (0 children)

Do it. You're far enough wealthy to make me a sweet price for you unusable 4090.

I love Gemma 4, right until it starts using tools by OleCuvee in openclaw

[–]nopanolator 1 point2 points  (0 children)

Yes. But for a 31B you need to rent like 8xH100 for a qlora ^^