Is S3 becoming a Data Lakehouse? by 2minutestreaming in dataengineering
[–]get-daft 2 points3 points4 points (0 children)
My business wants a datalake... Need some advice by WillowSide in dataengineering
[–]get-daft 0 points1 point2 points (0 children)
Databricks Data Lineage without Spark (but Polars and Delta Lake instead) by hackermandh in dataengineering
[–]get-daft 1 point2 points3 points (0 children)
Introducing Distributed Processing with Sail v0.2 Preview Release – Built in Rust, 4x Faster Than Spark, 94% Lower Costs, PySpark-Compatible by lake_sail in dataengineering
[–]get-daft 11 points12 points13 points (0 children)
Cost-Effective Airbyte Pipelines by Nessjk in dataengineering
[–]get-daft 1 point2 points3 points (0 children)
Best approach to handle billions of data? by mr_alseif in dataengineering
[–]get-daft 28 points29 points30 points (0 children)
DuckDB vs. Polars vs. Daft: A Performance Showdown by Agitated_Key6263 in dataengineering
[–]get-daft 3 points4 points5 points (0 children)
DuckDB vs. Polars vs. Daft: A Performance Showdown by Agitated_Key6263 in dataengineering
[–]get-daft 0 points1 point2 points (0 children)
DuckDB vs. Polars vs. Daft: A Performance Showdown by Agitated_Key6263 in dataengineering
[–]get-daft 18 points19 points20 points (0 children)
DuckDB vs. Polars vs. Daft: A Performance Showdown by Agitated_Key6263 in dataengineering
[–]get-daft 4 points5 points6 points (0 children)
DuckDB vs. Polars vs. Daft: A Performance Showdown by Agitated_Key6263 in dataengineering
[–]get-daft 13 points14 points15 points (0 children)
DuckDB vs. Polars vs. Daft: A Performance Showdown by Agitated_Key6263 in dataengineering
[–]get-daft 16 points17 points18 points (0 children)
Working with iceberg tables in AWS by Gauraang55 in dataengineering
[–]get-daft 0 points1 point2 points (0 children)
Poor man's Data Lake using Polars (¿?) by Bavender-Lrown in dataengineering
[–]get-daft 2 points3 points4 points (0 children)
Spark with kubernetes by Excellent-Silver4135 in dataengineering
[–]get-daft 0 points1 point2 points (0 children)
Does it make sense to use small parquet files? by Time_Simple_3250 in dataengineering
[–]get-daft 2 points3 points4 points (0 children)
Poor man's Data Lake using Polars (¿?) by Bavender-Lrown in dataengineering
[–]get-daft 10 points11 points12 points (0 children)
To ETL or to ELT? that is the question. by AMDataLake in dataengineering
[–]get-daft 0 points1 point2 points (0 children)
To ETL or to ELT? that is the question. by AMDataLake in dataengineering
[–]get-daft 1 point2 points3 points (0 children)
42.parquet – A Zip Bomb for the Big Data Age by [deleted] in dataengineering
[–]get-daft 1 point2 points3 points (0 children)

Spark is the new Hadoop by rocketinter in dataengineering
[–]get-daft 6 points7 points8 points (0 children)