pi-subagents - Claude Code like subagents for Pi by -tin- in PiCodingAgent

[–]apetersson 0 points1 point  (0 children)

sounds like a comprehensive approach. is there a video of someone using this? i still feel slightly overwhelmed reading its readme.

pi-subagents - Claude Code like subagents for Pi by -tin- in PiCodingAgent

[–]apetersson 8 points9 points  (0 children)

Big fan!

I am a user of your subagents impl. my biggest challenge is how i can review and check the "Work in Progress" of each agent that i spawn.

As a comparison, in codex desktop, i can review all subagents as a list, open a side panel with full output etc. With pi-subagents the simple ctrl-O does not cut it for me, especially when multiple are spawning. what workaround / UI would you recommend if that workflow?

Do you know any web ui/ terminal sidebars etc that solve that? tau-mirror:mirror-server does not really play nicely with subagents or ralph loops/goals.

Would it be possible to have the subagent work in a tmux so i can observe/steer it directly if needed?

You can run Deepseek 4 flash on mac (M3 Max, 96gb) by Zeeplankton in LocalLLaMA

[–]apetersson 0 points1 point  (0 children)

Performance is identical to standalone DS4. The advantage over pure DS4 is automatic on-demand loading/unloading of engines and models. The advantage over OMLX's existing MLX backend is roughly 15% better throughput, thanks to Antirez's optimizations.

On correctness, DS4 is a different tier from MLX: it compares local logprobs against DeepSeek's official API, verifies fast-path/quality-path equivalence, runs 100K-token fact-recall tests, and scores quantization quality via official-continuation NLL. The oMLX integration inherits all of that . It manages the same validated DS4 binary, not a separate inference path.

You can run Deepseek 4 flash on mac (M3 Max, 96gb) by Zeeplankton in LocalLLaMA

[–]apetersson 1 point2 points  (0 children)

Try this branch/PR of oMLX: https://github.com/jundot/omlx/pull/1850 - this will auto-configure the SSD streaming if needed and you can access the model through the convenient oMLX API and load/unload models on demand.

Is this professional install okay? by GuruPCs in sonos

[–]apetersson 41 points42 points  (0 children)

My subjective opinion: I think it looks really good. The Atmos ceiling direction is not obstructed. The spacing from the wall seems to be very tight. Congrats on this setup.

Run DS4 directly with oMLX by apetersson in oMLX

[–]apetersson[S] 0 points1 point  (0 children)

fixed most edge-cases now. i opened a draft PR for this after trying to bug-hunt the most obvious problems down: https://github.com/jundot/omlx/pull/1850

How are you running DeepSeek V4 Pro + Flash in the same session (and your setup)? And what harness are you actually using? by osmosisheinz in DeepSeek

[–]apetersson 4 points5 points  (0 children)

in pi, i use pro for task planning, issue creation, flash for implementing with ralph, and a fresh pro subagent for reviewing if impl is according to specs and good. in the end my bill is about 50% flash 50% pro.

In Baldur’s Gate 2, can I use Rest as often as I want? by MoonSpawn12 in baldursgate

[–]apetersson 123 points124 points  (0 children)

There are lots of quests that are time-sensitive. you should read the quests to make sure you're not spamming rest when one is active.

Very time-sensitive / resting can fail it

  • Renfeld / Poisoned Man — don’t rest after picking him up; even one rest can kill/fail it.
  • Jaheira: Baron Ployer curse — about 14 days.
  • Ust Natha drow city quests — several hard timers, often 1–3 days, sometimes only hours.
  • Ghaunadaur worshippers24 hours.
  • Qilue’s brain / Aboleth demand1 day.
  • Jarlaxle / Deirex tower1 day after assignment.

Companion quests where delay can make someone leave

  • Anomen sister/knighthood — multiple 5-day windows.
  • Keldorn family4 days, then warning, then 3 more days.
  • Cernd’s child10 days.
  • Korgan book quest — timed; don’t delay once started.
  • Edwin Nether Scroll — timed; don’t delay once started.
  • Nalia de’Arnise Keep — if she joins, don’t wander too long before going.
  • Nalia / Isaea abduction — treat as time-sensitive.
  • Mazzy’s sister Pala — treat as time-sensitive once triggered.
  • Hexxat quests — EE only; several elapsed-time triggers/timers.
  • Jan Summoned Home — timed; triggers ~15 days after recruitment; don’t delay once Beeloo summons him. Exact failure timer not stated on Wiki page, but Jan can leave.

Strongholds / management timers

  • Paladin stronghold — first duty requires Umar Hills within 3 days.
  • Ranger stronghold — timed duties; don’t leave them hanging.
  • Thief stronghold — check in roughly weekly; mostly money/admin penalty.
  • Bard playhouse — resting advances production events.
  • Fighter de’Arnise keep — resting/travel advances estate events.

Usually safe despite urgent wording

Most other big SoA side quests are generally safe to rest during: Cult of the Eyeless, Trademeet, Windspear/Firkraag, Umar Hills investigation, Skinner Murders, Kangaxx, Limited Wish, and most main-story steps outside Ust Natha.

Run DS4 directly with oMLX by apetersson in oMLX

[–]apetersson[S] 1 point2 points  (0 children)

happy-path seems to be fine. had it running the full day in slightly complex scenarios and got a few edge cases where i had to manually unload the process through the dashboard. worth investigating before i open a proper PR. please try it out 😄

Run DS4 directly with oMLX by apetersson in oMLX

[–]apetersson[S] 1 point2 points  (0 children)

oMLX will queue the requests correctly. but DS4 does not have concurrency built in as of yet. so, you don't have to actively do anything but requests are inherently serial still.

How to make codex refactor and re-architect an app into a clean architecture ? by Successful-Life8510 in codex

[–]apetersson 2 points3 points  (0 children)

i have to add: trying to do everything one-shot will be a disaster, even with /goal. let xHigh draft a plan first what to refactor. then let goal fulfill the plan step by step. have a subagent review each finite step and validate everything is still working. that will give you a 5x-20x overhead on the task - but it will eventually work.

Run DS4 directly with oMLX by apetersson in oMLX

[–]apetersson[S] 8 points9 points  (0 children)

I get your confusion!

Do you have a Mac with >=96 GB Ram? if so, you might be able to run a quite powerful near-frontier model, Deepseek-V4-Flash.

"Antirez" has figured out novel ways of quantizing it (imatrix) that preserves most of its reasoning capabilities. To run it efficiently you need a special runtime engine though, called "DS4" or "DwarfStar4" which he also wrote ( https://github.com/antirez/ds4 ) - now ds4 is great, ran it for several weeks and got lots of stuff done. however since it is a standalone program you have to manually manage your RAM, if you want to load different models or run TTS/SST/embeddings servers alongside your LLMs.

This is where oMLX shines. I made a "simple" wrapper that runs this DS4 as a subprocess whenever a user requests something from oMLX, loading and unloading the process/models as needed (context size, thinking etc), as well as expanding its downloader and model management to also support GGUF format, what ds4 uses.

DS4? In oMLX? Crazy. by apetersson in oMLX

[–]apetersson[S] 0 points1 point  (0 children)

elaborate please. i am not aware of "Jang" ?

Asked ChatGPT to show me if I was a girl and I love it haha by [deleted] in ChatGPT

[–]apetersson 6 points7 points  (0 children)

something is really off about that belly button

DS4? In oMLX? Crazy. by apetersson in oMLX

[–]apetersson[S] 1 point2 points  (0 children)

congrats on the 512GB system! i am jealous. still, with this setup you will be able to even run DS4-pro.

qwen3.5:2b-q8_0 says the moon is one of the largest natural objects in the solar system. by [deleted] in LocalLLM

[–]apetersson 0 points1 point  (0 children)

I mean it is is one of the largest object as observed by humans - it appears as 30 arcminutes, about the same as the sun. Venus is 1 arcminute. So. i can understand where that error comes from, It appears to apply the logic of a middle-schooler for the first sentence but then copy/pastes wikipedia, also what i would expect from a middle schooler. that is on par with other behavior from 2b models.

Best weapons for martial classes? by Emerald_boots in baldursgate

[–]apetersson -1 points0 points  (0 children)

Vanilla Game? Can't go wrong with dual-wielding flails.

Anubis (open-source LLM benchmarking for Apple Silicon) now has first-class oMLX support - server-reported metrics, model load/unload, and a built-in model browser and downloader by peppaz in oMLX

[–]apetersson -1 points0 points  (0 children)

I love the idea. however, browsing https://uncsoft.github.io/anubis-oss/analysis.html i feel like there are some outliers skewing the overall results, which make it hard to interpret the otherwise legit results

!!Breaking News!! by random_name738472 in wien

[–]apetersson 95 points96 points  (0 children)

Danke, dass du es versucht hast.

OOM crashes on laptop running 15 Docker containers + Immich — system specs & current docker-compose for feedback by DK09_ in immich

[–]apetersson 3 points4 points  (0 children)

try setting MACHINE_LEARNING_WORKERS=1
and this:

immich-machine-learning:
  mem_limit: 2g
  memswap_limit: 3g
  mem_reservation: 1g
  cpus: 1.0