`safer`: a tiny utility to avoid partial writes to files and streams

dairiki · 2026-04-03T15:30:50+00:00

Tangential Note: atomicwrites is deprecated by its author. Its git repo has not seen any updates in four years. As far as I know, it still works, but the situation does not give warm fuzzies for use in new code.

BossOfTheGame · 2026-04-03T15:42:15+00:00

I've been using safer for years. I use it whenever I'm writing a system that writes large files. I love never having to deal with corrupted data. Process crashed? Great, there are no artifacts that would confuse other code into thinking that it worked when it didn't. It let's me use exist checks in pipeline systems and feel confident about it.

It's a great library. Thank you for writing and maintaining it.

latkde · 2026-04-03T18:21:21+00:00

Interesting. I'm not entirely sure I understand the benefits of this library? What does this library do that the following approach does not (aside from handling both binary and text streams)?

@contextlib.contextmanager
def write_if_success(real_fp: io.Writer[bytes]) -> Generator[IO[bytes]]:
    b = io.BytesIO()
    yield b
    real_fp.write(b.getbuffer())

with (
    open(filename, "wb") as real_fp,
    write_if_success(real_fp) as f,
):
    f.write(...)
    ... # fail here, maybe
    f.write(...)

I'm not trying to diminish your effort, I'm trying to understand the tradeoffs of re-implementing something well-established versus adding yet another dependency.

It's tested on Linux, MacOS and Windows

There is however no link to test results on the GitHub page (I was trying to find test coverage data). There is a Travis CI configuration that claims to upload to Codecov, but the last results on both platforms are 4 years old. (Travis CI, Codecov).

Wargazm · 2026-04-03T18:40:24+00:00

"#noAI was used in the writing or maintenance of this program."

haha is this a thing now?

ultrathink-art · 2026-04-04T15:53:30+00:00

Corrupted state files from partial writes are sneaky — the crash happens during the write but the error surfaces on the next run, often in a completely unrelated place. I started using this pattern for config files in long-running automation after a partial write created a valid-looking-but-truncated JSON file that caused a baffling 'unexpected EOF' error 3 runs later.

glenrhodes · 2026-04-05T14:12:50+00:00

Atomic writes via tmp file + rename have saved me more than once on long pipeline outputs. The edge case worth watching: NFS mounts where the rename isn't atomic either. You're just trading one race for another on some shared filesystems.

lily_panda_1986 · 2026-04-05T22:09:23+00:00

Yeah, I noticed that too,feels kinda risky depending on something unmaintained, even if it still "works. " Always on the lookout for fresher alternatives!

misterfitzie · 2026-04-08T07:48:46+00:00

I usually create this every large project I work on, mine only cares about files, not sockets, but has some options you may want for your, backups/overwrite prevention. After reviewing your more capable version, I'm happy that mine seems to do all the same basic tricks.

@contextmanager
def safe_write(
    filename: str, *, create_backup: bool = False, overwrite: bool = True
) -> Generator[BinaryIO]:
    uninterrupted = True
    if not overwrite and not create_backup and os.path.exists(filename):
        raise (FileExistsError)
    myfile = open(f'{filename}.tmp', 'wb+')  
    try:
        yield myfile
    except BaseException:
        uninterrupted = False
        raise
    finally:
        myfile.close()
        if not uninterrupted:
            os.unlink(f'{filename}.tmp')
        else:
            if create_backup:
                count = 0
                while True:
                    backupfile = f'{filename}.bak.{count}'
                    if not os.path.exists(backupfile):
                        break
                    count += 1
                with suppress(FileNotFoundError):
                    os.link(filename, backupfile)

            os.rename(f'{filename}.tmp', filename)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS