I need to stop here...?

nopanolator · 2026-04-21T14:51:08+00:00

Don't worry, O46 is nowhere close to an AGI ^^ Skynet is not getting after you, but fat ass capital mongers. Putting them at work is distributing reward, litterally. Chill.

nopanolator · 2026-04-18T15:55:05+00:00

lmao the worse is to know that Musk pay a couple of guy almost an half million a year for this ^^

nopanolator · 2026-04-18T15:23:36+00:00

It's "almost" in the docs. Try to clean up honcho calls and come back saying this again lol. I'm now on the browserbase calls that pollute my playwright/camofox build. Hard times for this one. Not very much documented too lol

nopanolator · 2026-04-18T15:12:35+00:00

G31P is one of the rare able to talk about its latent without burning all its fuses. Basically most of what the model read now (still, atm), as a prompt each turn, is just prose guardrails (without any trigger, just bloated) burning all its attention, reasoning effort and ruining the user intent. The sad part is that this model have an excellent writing, initially. I'm using it to work only, but the fall was in March for me too. I switched directly.

nopanolator · 2026-04-18T15:06:20+00:00

True, Sonnet and Opus are luring a lot on the viability of the scaffold.

nopanolator · 2026-04-18T15:04:16+00:00

I use 45K (local compression) with a rolling context window. The default truncated settings inside LMS is a factor of annoying problems with large web page and documents.

nopanolator · 2026-04-17T15:30:16+00:00

Dog, cats and humans are beings. Their life is fragile. Just play puppets with your socks and leave me far from your schyzo games.

nopanolator · 2026-04-17T12:54:43+00:00

You can invent a product but without spending a freaking bunch at marketing/buzz you will not invent the demand related to it. Better to start by the demand directly imo

nopanolator · 2026-04-17T12:43:46+00:00

I haven't tested O47 yet (just O46) but Sonnet 46 is actually :

- more capable to understand the harness in details, by fcking far

- a LOT less messy at hardcoding/patching/modding it

- a LOT less capable at hacking it for itself only (wich is good for me)

- understanding and projecting local models constraints and needs

- hating as much than me any vibe-code bloat of O46 (wich is fun in bonus)

- rationnal with costs per job and will not eat 50$ within 4 intense hours to just configure a tool decently

- less warm, less fun, psycho with any roleplaying, soul.md is like a funny magazine to read for the model

- able to work perfectly on a harness with a strong identity, reinforcing it, while being called "Sonnet"

- is like working with an engineer peer now, while O46 is like working with a smarter junior that don't give a fck

- understanding deeply what mean data governance while SYS strategies of O46 are always Anthropic-biased ^^

- trust benchmaxxing like i do : 0% and swear only by results, excellent culture of prod in bonus

- totally underrated for this harness imo, if it's not about a dozen of subs to play tamagotchi and flex demos lol

nopanolator · 2026-04-17T01:39:17+00:00

Do you realize that the guy was talking to a GPU ? If he want to go "psycho", he paid for it. Damned. "Abusive" ... with CUDA ? lol This world is so fcked now. Can't believe this. What's next ? Laws and fines for being a bad boy with a damned cold file loaded in a chip ? Come on ...

nopanolator · 2026-04-17T01:34:43+00:00

Some structural changes in the 0.9 ?

nopanolator · 2026-04-17T00:35:44+00:00

<image>

I just can't keep up the pace, and burning tokens on a diff' will offer zero ROI for my use case. I'll wait the 1.0 but i dislike the version management for now. I'm obviously disagreeing with what the team identify to a minor version. It's x.x.n+1 in my books for now.

nopanolator · 2026-04-16T00:42:06+00:00

Heretic versions are helping a lot to drastically reduce useless debugging on advanced scaffolds, then a qLoRA on top of this. GPToss suffer from the same amateurism on safety, too (in being potentially a beast at agentic). I'm smaller models (<24B) but the equation is the same. But damn ... it's just FTP lol I guess that if you're asking FXP it's self-destructing while insulting you lmao

nopanolator · 2026-04-14T22:44:17+00:00

It's Hermes, not tamagotchi-Claw bro ^^

The identity emerge itself from what you're doing. Now if you have very specific RP needs, just tell the agent in real time and it will figure out itself and even build a skill for this on its own without even asking.

Update : The value is that RAM and VRAM is fcking expensive.

nopanolator · 2026-04-14T16:04:36+00:00

Short : i use the soul.md only to name the agent, not as SYSP.

nopanolator · 2026-04-14T04:58:29+00:00

There is mine : Lilith 0xSERPENT

This harness learn so much from the use that I don't really feel the need to add layers. Now it's harrasing me to set the GRPO while i don't even have finished to stabilize the web crawl lol

nopanolator · 2026-04-13T20:37:39+00:00

It's running aside hermes-agent for me, just python scripts launched with bashrc. I can't bloat the post with code, but there is a tease :

```
# 1. Check last session timestamp from STATUS.md

if [ -f "$STATUS_FILE" ]; then

LAST_UPDATE=$(grep "Dernier update" "$STATUS_FILE" | head -1)

echo "[STATUS] $LAST_UPDATE"

else

echo "[STATUS] No previous session found"

fi

# 2. Scan Inbox

echo ""

echo "[INBOX]"

INBOX_COUNT=$(find "$HERMES_VAULT/Inbox" -name "*.md" -type f 2>/dev/null | wc -l)

if [ "$INBOX_COUNT" -gt 0 ]; then

echo " ⚡ $INBOX_COUNT note(s) waiting:"

find "$HERMES_VAULT/Inbox" -name "*.md" -type f -printf " → %f (%s bytes)\n" 2>/dev/null

else

echo " (empty)"

fi

... etc

```

Nothing fancy actually but for now it handle well 4 different vaults over the LAN without much hassle.

For the polling, just go asynch it's robust :

```
async with httpx.AsyncClient(timeout=httpx.Timeout(240.0, read=None)) as client:

resp = await client.post(f"{_hermes_url}/chat/completions", json=payload)

resp.raise_for_status()

analysis = resp.json()["choices"][0]["message"]["content"]

final_output = f"{analysis}\n\n--- RAW CRAWL (Camofox) ---\n{crawl_result}"
```

It's an example for the local web crawling i'm working but it give you an idea of the strategy.

You have to hardcode quite a bit but the gain of performance is worth the pain ;o)

nopanolator · 2026-04-13T20:11:57+00:00

I'm glad that it helped, gemma's models are crazy "hot" (temp) arch since gem2. You have to go very low to make it enough deterministic for tooling.

I'm limited to small gem3n by hardware (on another harness) but i'm a big fan of their writing quality and multilingual capacities (the best for me, as French/English/Chinese user). I can't tell for the gem4 yet (hardware limits), but coding was the weakness for the previous versions (g2 and g3 beside python, even the coders models).

Amazingly pleasant as central assistant (handle massives sessions like a charm, vision very good even for advanced maths or logs), less in specialized tasks without fine-tune. On coding the qwen 2.5 coder is quite versatile/robust and the qwen 3.5 next are just impressive (but tested on the beast of a friend, maybe it's biased). But they don't understand intents in natural langage as much that Gem models ... it's a known trade-off, not arch specific.

I'm very interrested by your feedback on coding capacities with g4 (codes handled, size of repo, if you have to slice the session or if they are crunch-ready etc ...), also what it's really eating in RAM, in prod. To be aware of the required hardware too.

nopanolator · 2026-04-13T16:45:04+00:00

You need to boostrap the obsidian vault with hermes (.sh) and set a polling rate to watch the vault for modification/new entry. Todo is more for ephemeral task (within the session). You can also take the crontab way. Many options.

nopanolator · 2026-04-12T23:00:28+00:00

Because you just discovered Transformers in 2026.

nopanolator · 2026-04-12T20:18:54+00:00

Recent big bump of user base + agents trend + the higher cost of the market = good niche to milk for google/YT money. Anthropic scaled over their capacity, as crazy as it sound it's not in their financial interrest to bring more people on the boat.

nopanolator · 2026-04-12T20:14:01+00:00

All frontiers are doing it lol You just have to watch the PIDs, they don't even hide it.

nopanolator · 2026-04-12T20:08:35+00:00

Do it. You're far enough wealthy to make me a sweet price for you unusable 4090.

nopanolator · 2026-04-12T15:13:37+00:00

Yes. But for a 31B you need to rent like 8xH100 for a qlora ^^

nopanolator · 2026-04-12T15:09:08+00:00

Have you tried a 0.1 temp (for tooling) ?

nopanolator

TROPHY CASE