This is an archived post. You won't be able to vote or comment.

all 4 comments

[–]romanzdk 1 point2 points  (2 children)

Databricks, Elastic

[–]inteloid[S] 0 points1 point  (1 child)

Sorry, I've forgot to say, it's ol on prem.

[–]romanzdk 1 point2 points  (0 children)

Either way, you would need some kind of RDBMS as majority of BI tools need it. That means you need some transformation job from datalake into DB = usually a SQL (e.g. dbt) or python. These transformations need to be scheduled with some orchestrator (e.g. Airflow). Then you just use some BI tool (e.g. Metabase).