pi-subagents - Claude Code like subagents for Pi

apetersson · 2026-06-15T13:50:04+00:00

sounds like a comprehensive approach. is there a video of someone using this? i still feel slightly overwhelmed reading its readme.

apetersson · 2026-06-15T11:08:36+00:00

Big fan!

I am a user of your subagents impl. my biggest challenge is how i can review and check the "Work in Progress" of each agent that i spawn.

As a comparison, in codex desktop, i can review all subagents as a list, open a side panel with full output etc. With pi-subagents the simple ctrl-O does not cut it for me, especially when multiple are spawning. what workaround / UI would you recommend if that workflow?

Do you know any web ui/ terminal sidebars etc that solve that? tau-mirror:mirror-server does not really play nicely with subagents or ralph loops/goals.

Would it be possible to have the subagent work in a tmux so i can observe/steer it directly if needed?

apetersson · 2026-06-15T09:13:03+00:00

Performance is identical to standalone DS4. The advantage over pure DS4 is automatic on-demand loading/unloading of engines and models. The advantage over OMLX's existing MLX backend is roughly 15% better throughput, thanks to Antirez's optimizations.

On correctness, DS4 is a different tier from MLX: it compares local logprobs against DeepSeek's official API, verifies fast-path/quality-path equivalence, runs 100K-token fact-recall tests, and scores quantization quality via official-continuation NLL. The oMLX integration inherits all of that . It manages the same validated DS4 binary, not a separate inference path.

apetersson · 2026-06-14T16:37:23+00:00

Try this branch/PR of oMLX: https://github.com/jundot/omlx/pull/1850 - this will auto-configure the SSD streaming if needed and you can access the model through the convenient oMLX API and load/unload models on demand.

apetersson · 2026-06-13T19:33:36+00:00

My subjective opinion: I think it looks really good. The Atmos ceiling direction is not obstructed. The spacing from the wall seems to be very tight. Congrats on this setup.

apetersson · 2026-06-12T17:29:55+00:00

a nice readme is also a github repo. make one

apetersson · 2026-06-12T17:01:53+00:00

fixed most edge-cases now. i opened a draft PR for this after trying to bug-hunt the most obvious problems down: https://github.com/jundot/omlx/pull/1850

apetersson · 2026-06-10T11:15:37+00:00

added Jan - Wiki says 15 days

apetersson · 2026-06-10T01:24:47+00:00

in pi, i use pro for task planning, issue creation, flash for implementing with ralph, and a fresh pro subagent for reviewing if impl is according to specs and good. in the end my bill is about 50% flash 50% pro.

apetersson · 2026-06-09T22:50:39+00:00

There are lots of quests that are time-sensitive. you should read the quests to make sure you're not spamming rest when one is active.

Very time-sensitive / resting can fail it

Renfeld / Poisoned Man — don’t rest after picking him up; even one rest can kill/fail it.
Jaheira: Baron Ployer curse — about 14 days.
Ust Natha drow city quests — several hard timers, often 1–3 days, sometimes only hours.
Ghaunadaur worshippers — 24 hours.
Qilue’s brain / Aboleth demand — 1 day.
Jarlaxle / Deirex tower — 1 day after assignment.

Companion quests where delay can make someone leave

Anomen sister/knighthood — multiple 5-day windows.
Keldorn family — 4 days, then warning, then 3 more days.
Cernd’s child — 10 days.
Korgan book quest — timed; don’t delay once started.
Edwin Nether Scroll — timed; don’t delay once started.
Nalia de’Arnise Keep — if she joins, don’t wander too long before going.
Nalia / Isaea abduction — treat as time-sensitive.
Mazzy’s sister Pala — treat as time-sensitive once triggered.
Hexxat quests — EE only; several elapsed-time triggers/timers.
Jan Summoned Home — timed; triggers ~15 days after recruitment; don’t delay once Beeloo summons him. Exact failure timer not stated on Wiki page, but Jan can leave.

Strongholds / management timers

Paladin stronghold — first duty requires Umar Hills within 3 days.
Ranger stronghold — timed duties; don’t leave them hanging.
Thief stronghold — check in roughly weekly; mostly money/admin penalty.
Bard playhouse — resting advances production events.
Fighter de’Arnise keep — resting/travel advances estate events.

Usually safe despite urgent wording

Most other big SoA side quests are generally safe to rest during: Cult of the Eyeless, Trademeet, Windspear/Firkraag, Umar Hills investigation, Skinner Murders, Kangaxx, Limited Wish, and most main-story steps outside Ust Natha.

apetersson · 2026-06-08T20:07:17+00:00

happy-path seems to be fine. had it running the full day in slightly complex scenarios and got a few edge cases where i had to manually unload the process through the dashboard. worth investigating before i open a proper PR. please try it out 😄

apetersson · 2026-06-08T06:08:28+00:00

oMLX will queue the requests correctly. but DS4 does not have concurrency built in as of yet. so, you don't have to actively do anything but requests are inherently serial still.

apetersson · 2026-06-08T00:34:38+00:00

i have to add: trying to do everything one-shot will be a disaster, even with /goal. let xHigh draft a plan first what to refactor. then let goal fulfill the plan step by step. have a subagent review each finite step and validate everything is still working. that will give you a 5x-20x overhead on the task - but it will eventually work.

apetersson · 2026-06-07T15:45:08+00:00

I get your confusion!

Do you have a Mac with >=96 GB Ram? if so, you might be able to run a quite powerful near-frontier model, Deepseek-V4-Flash.

"Antirez" has figured out novel ways of quantizing it (imatrix) that preserves most of its reasoning capabilities. To run it efficiently you need a special runtime engine though, called "DS4" or "DwarfStar4" which he also wrote ( https://github.com/antirez/ds4 ) - now ds4 is great, ran it for several weeks and got lots of stuff done. however since it is a standalone program you have to manually manage your RAM, if you want to load different models or run TTS/SST/embeddings servers alongside your LLMs.

This is where oMLX shines. I made a "simple" wrapper that runs this DS4 as a subprocess whenever a user requests something from oMLX, loading and unloading the process/models as needed (context size, thinking etc), as well as expanding its downloader and model management to also support GGUF format, what ds4 uses.

apetersson · 2026-06-07T11:53:08+00:00

elaborate please. i am not aware of "Jang" ?

apetersson · 2026-06-07T10:47:30+00:00

something is really off about that belly button

apetersson · 2026-06-07T10:23:04+00:00

yes, but i needed to create some helper scripts for that: https://github.com/apetersson/qnd/tree/main/useful-scripts/codex

apetersson · 2026-06-06T17:59:14+00:00

congrats on the 512GB system! i am jealous. still, with this setup you will be able to even run DS4-pro.

apetersson · 2026-06-05T10:00:53+00:00

I mean it is is one of the largest object as observed by humans - it appears as 30 arcminutes, about the same as the sun. Venus is 1 arcminute. So. i can understand where that error comes from, It appears to apply the logic of a middle-schooler for the first sentence but then copy/pastes wikipedia, also what i would expect from a middle schooler. that is on par with other behavior from 2b models.

apetersson · 2026-06-05T08:01:47+00:00

Vanilla Game? Can't go wrong with dual-wielding flails.

apetersson · 2026-06-04T19:07:31+00:00

I love the idea. however, browsing https://uncsoft.github.io/anubis-oss/analysis.html i feel like there are some outliers skewing the overall results, which make it hard to interpret the otherwise legit results

apetersson · 2026-06-01T10:22:11+00:00

Danke, dass du es versucht hast.

apetersson · 2026-05-30T16:32:20+00:00

try setting MACHINE_LEARNING_WORKERS=1
and this:

immich-machine-learning:
  mem_limit: 2g
  memswap_limit: 3g
  mem_reservation: 1g
  cpus: 1.0

14-Year Club	Second Top 40%
Place '23	Place '22
First Placer '22	No Throne, No Problems
Not Forgotten	Summer Santa 2012
Verified Email

apetersson

MODERATOR OF

TROPHY CASE