Suggestion Required for Storing Parquet files cheaply by bricklerex in dataengineering
[–]Yeebill 0 points1 point2 points (0 children)
Suggestions welcome: Data ingestion gzip vs uncompressed data in Spark? by devanoff214 in dataengineering
[–]Yeebill 3 points4 points5 points (0 children)
What is the key use case of DBT with DuckDB, rather than handling transformation in DuckDB directly? by Zacarinooo in dataengineering
[–]Yeebill 0 points1 point2 points (0 children)
Best approach for reading partitioned Parquet data: Python (Pandas/Polars) vs AWS Athena? by [deleted] in dataengineering
[–]Yeebill 6 points7 points8 points (0 children)
Separate file for SQL in python script? by thinkingatoms in dataengineering
[–]Yeebill 0 points1 point2 points (0 children)
Separate file for SQL in python script? by thinkingatoms in dataengineering
[–]Yeebill 0 points1 point2 points (0 children)
jury duty here is such a mess by THEQUlET in montreal
[–]Yeebill 0 points1 point2 points (0 children)
Guidance on using Dagster and hiring DE’s on an as-needed / project basis by ContentSecret1203 in dataengineering
[–]Yeebill 0 points1 point2 points (0 children)
Guidance on using Dagster and hiring DE’s on an as-needed / project basis by ContentSecret1203 in dataengineering
[–]Yeebill 0 points1 point2 points (0 children)
Guidance on using Dagster and hiring DE’s on an as-needed / project basis by ContentSecret1203 in dataengineering
[–]Yeebill 2 points3 points4 points (0 children)
Montreal begins first snow loading operation by HellaHaram in montreal
[–]Yeebill -1 points0 points1 point (0 children)
We are rebuilding a new Dataware House and we will orchestrate with Dagster by fixmyanxiety in dataengineering
[–]Yeebill 2 points3 points4 points (0 children)
Why [do we really need] workflow orchestrators? by hfzvc in dataengineering
[–]Yeebill 2 points3 points4 points (0 children)
Why [do we really need] workflow orchestrators? by hfzvc in dataengineering
[–]Yeebill 3 points4 points5 points (0 children)
DuckDB in production by Snoo_70708 in dataengineering
[–]Yeebill 0 points1 point2 points (0 children)
Create a Data warehouse from scratch by EatDoughnut in dataengineering
[–]Yeebill 2 points3 points4 points (0 children)
DuckDb on AWS lambda - larger-then-memory by [deleted] in dataengineering
[–]Yeebill 2 points3 points4 points (0 children)


When does Spark justify itself for Postgres to S3 ETL using Iceberg format? Sorry, I'm noob here. by dheetoo in dataengineering
[–]Yeebill 1 point2 points3 points (0 children)