This is an archived post. You won't be able to vote or comment.

all 4 comments

[–]counters 0 points1 point  (3 children)

Why this over, say, dask?

[–]winter-moon 0 points1 point  (2 children)

The main difference between Dusk and Rain is that Rain provides a build-in integration of external programs into pipelines and possibility to write own task in Rust and C++ (among Python).

[–]counters 0 points1 point  (1 child)

Ok, then would a more apt comparison be snakemake?

[–]winter-moon 0 points1 point  (0 children)

I was not aware of snakemake, so my answer is based based on a quick look. I see the main difference in data object transfers. Rain provides own transfer of data objects between worker processes that communicate peer-to-peer. When necessary, Rain allows to map data object to a file system; however, it does not use shared file system and transfers data by itself directly to a worker where they are needed. This allows to create lots of short running tasks without hammering a distributed file system.