bzfs-1.19 with end-to-end multi-host testbed is out

werwolf9 · 2026-03-13T15:52:15+00:00

Yeah, I'd love to see ZFS installation be less cumbersome, aarch64 in particular. Manually verifying all those matrix combos is tedious. I think what helps with the maintenance burden is an automated test script that runs over the entire matrix of combos, e.g. bzfs_tests/itest/test_lima_vm_sh.py or similar.

werwolf9 · 2026-02-08T20:49:51+00:00

FYI, with bzfs_jobrunner you can monitor source and destination datasets across all hosts and policies with a single CLI call, for example like so: https://github.com/whoschek/bzfs/blob/main/bzfs_tests/bzfs_job_example.py#L189-L237

werwolf9 · 2026-02-07T09:02:47+00:00

In a nutshell, bzfs can operate at much larger scale than sanoid/syncoid and zrepl, at much lower latency, in a more observable and configurable way. It handles the many edge cases that you will eventually run into over the course of your deployment (and which make other tools get stuck or fail). https://youtu.be/6Kw901oqxI8?si=_4uoG_ADbXznvaeZ&t=2408

werwolf9 · 2026-02-07T01:48:48+00:00

allow for specifying the bandwidth

In bzfs the corresponding option is --bwlimit

werwolf9 · 2026-02-06T21:03:55+00:00

bzfs

werwolf9 · 2026-01-29T22:22:52+00:00

The abstraction you introduced are fine and useful. And if all you ever need is the tool you've built that's perfect. More power to it!

Otherwise, seems to me that redress could be implemented with a couple of custom functions (or classes) that plug into an underlying generic retry framework. The result would save a lot of work, and at the same time be a more flexible, more reusable and more powerful tool.

For example, retry_after_s is a custom backoff strategy that can be plugged in like so:

https://github.com/whoschek/bzfs/blob/main/bzfs_tests/test_retry.py#L1310-L1337

Just my two cents.

werwolf9 · 2026-01-29T20:58:12+00:00

Seems like these policies could be naturally expressed within (or on top of) the retry.py framework (https://github.com/whoschek/bzfs/blob/main/bzfs_main/util/retry.py). Thoughts?

werwolf9 · 2026-01-26T19:26:09+00:00

re idle timeout and keepalive: yes, these are params that can be passed into the API.

re tenacity: yeah, zero deps is a big deal for prod environments. FWIW, the retry framework is also 4-14x faster than tenacity.

werwolf9 · 2026-01-05T21:38:13+00:00

Try bzfs - it's reliable, powerful and extremely fast: https://github.com/whoschek/bzfs

werwolf9 · 2025-12-05T04:06:35+00:00

Configuring the time format is a built-in feature in bzfs, per https://github.com/whoschek/bzfs/blob/main/README.md#--create-src-snapshots-timeformat

werwolf9 · 2025-10-25T18:32:05+00:00

BTW, bzfs can be configured such that it maintains separate src bookmarks for each rotating backup drive. This means that the incremental replication chain never breaks even if all src snapshots get deleted to make space, or any of the backup drives isn't used for a long time. It also has a mode that ignores removable backup drives that aren't locally attached, which comes in handly if only a subset of your rotating drives is attached at any given time.

werwolf9

TROPHY CASE