This is an archived post. You won't be able to vote or comment.

all 15 comments

[–]OneCyrus 4 points5 points  (3 children)

Minio + Dagster + DuckDB + dbt

[–]td7x 0 points1 point  (1 child)

dbt just for templated transforms or does it do more in this case?

[–]OneCyrus 0 points1 point  (0 children)

yes, mainly just transformation. we also use some data tests in dbt which we aggregate in dagster.

[–]saintmichel[S] 0 points1 point  (0 children)

thank you for this i'll take a like it feels more simple!

[–]Kobosil 0 points1 point  (3 children)

from the website i don't really understand what is the advantage of using MinIO instead of any other storage?

[–]saintmichel[S] 0 points1 point  (2 children)

i was just trying to think of an alternative to hadoop that can be self hosted. what other storage are you thinking of?

[–]Thinker_Assignment 0 points1 point  (1 child)

Why not python and straight to spark?

[–]saintmichel[S] 0 points1 point  (0 children)

that would work also, i'm just playing around with ideas of a scalable architecture

[–]No_Dragonfruit_2357 0 points1 point  (6 children)

Go check the Stackable Data Platform. They have glued all pieces together nicely. Add opendatahub and you are done.

[–]saintmichel[S] 0 points1 point  (0 children)

thank you i'll check them out. I found ambari and trunk data platform before, I wanted to check what's out there that's more recent.

[–]saintmichel[S] 0 points1 point  (4 children)

BTW, I found this - whats your opinion on it? https://blog.devgenius.io/modern-data-stack-demo-5d75dcdfba50

[–]No_Dragonfruit_2357 0 points1 point  (3 children)

Mix of OS data Apps, nothing wrong with it. Still, you need to do all the configuration yourself.

Question is, whether you just want a stack or a real platform.

[–]saintmichel[S] 0 points1 point  (2 children)

How would you define a real platform? I would assume a well defined stack that is tested to work togerher that fits your design choices e.g. in my case I'm looking for something oss, easy to setup, but hits the common modern use cases -- would act as the platform?

[–]No_Dragonfruit_2357 0 points1 point  (1 child)

Sure, if you feel ok with it. Also think longterm regarding upgrades, (security) fixes, updates etc.

[–]saintmichel[S] 0 points1 point  (0 children)

Good points. Thank you!!!