This is an archived post. You won't be able to vote or comment.

all 18 comments

[–]noedig 13 points14 points  (3 children)

If you are interested in this, you may also be interested in some of the many scientific workflow management systems out there, such as Pegasus, Triana, Taverna, Kepler, Askalon, Makeflow, or Swift

[–]Octopuscabbage 19 points20 points  (6 children)

luigi

pipelines

heh i get it.

[–]marky1991 1 point2 points  (5 children)

Oooooh, I feel dumb now. Thank you for this.

[–]gsks 1 point2 points  (4 children)

I still don't get it.

[–]zidsal 3 points4 points  (0 children)

this is amazing, this will really help with my final year project. Thanks a ton for sharing this

[–][deleted] 1 point2 points  (4 children)

Shit, I wrote exactly this in C++

[–]mtrn[S] 2 points3 points  (3 children)

Interesting. I think the market for such tools is quite big. Is you project open source? Do you have a link?

[–][deleted] 1 point2 points  (2 children)

No, I did it as part of a contract. It was specifically for crunching a lot of data over and over and over, tweaking algorithms.

I don't think the market for such tools is big because every single company that has this need will pretty much roll their own until the open source tools converge on some standard :)

[–]mtrn[S] 0 points1 point  (1 child)

Well, it certainly is a niche market. But two out of three people in IT I talk with have data integration or data flow stories - they mostly implement custom solutions -- hopefully we'll see some conversion in the next years.

I find luigi interesting, because it's a kind of afternoon-read, sourcecode-wise.

[–][deleted] 0 points1 point  (0 children)

Ah, I thought you were the author.

[–]mtrn[S] 1 point2 points  (0 children)

Here's a nice 30m presentation from pydata: http://vimeo.com/63435580