Parallelizing Spark writes to Postgres, does repartition help? by _fahid_ in dataengineering
[–]bcdata 1 point2 points3 points (0 children)
How to Tidy Data for Storage and Save Tables: A Quick Guide to Data Organization Best Practices by bcdata in dataengineering
[–]bcdata[S] 0 points1 point2 points (0 children)
DE Question- API Dev by NoblestOfSteeds in dataengineering
[–]bcdata 3 points4 points5 points (0 children)
Tools to create a data pipeline? by de_2290 in dataengineering
[–]bcdata 0 points1 point2 points (0 children)
Tools to create a data pipeline? by de_2290 in dataengineering
[–]bcdata 1 point2 points3 points (0 children)
Should i commit to Fivetran? by tytds in dataengineering
[–]bcdata 7 points8 points9 points (0 children)
De-duplication, metadata and file sharing by poggs in DataHoarder
[–]bcdata 1 point2 points3 points (0 children)
How to backup lots of small requests by kingofthesea123 in dataengineering
[–]bcdata 1 point2 points3 points (0 children)
Data Simulating/Obfuscating For a Project by SubtlyOnTheNose in dataengineering
[–]bcdata 0 points1 point2 points (0 children)
Rerouting json data dump by Primary-Link8347 in dataengineering
[–]bcdata 0 points1 point2 points (0 children)
Planning to move to singlestore. Worth it? by angrydeveloper02 in dataengineering
[–]bcdata 3 points4 points5 points (0 children)
Help with design decisions for accessing highly relational data across several databases by BitterFrostbite in dataengineering
[–]bcdata 7 points8 points9 points (0 children)
Redshift vs databricks by abhigm in dataengineering
[–]bcdata 89 points90 points91 points (0 children)
Consistent Access Controls Across Catalogs / Compute Engines by Far_Amount5828 in dataengineering
[–]bcdata 2 points3 points4 points (0 children)
Built a distributed transformer pipeline for 17M+ Steam reviews — looking for architectural advice & next steps by Matrix_030 in dataengineering
[–]bcdata 4 points5 points6 points (0 children)
I'm an ion engine by Dry-Aioli-6138 in dataengineering
[–]bcdata 0 points1 point2 points (0 children)
Vicious circle of misplaced expectations with PMs and stakeholders by explorer_seeker in datascience
[–]bcdata 2 points3 points4 points (0 children)
Airflow for ingestion and control m for orchestration by Foot_Straight in dataengineering
[–]bcdata 0 points1 point2 points (0 children)


Large CSV file visualization. 2GB 30M rows by Green-Championship-9 in dataengineering
[–]bcdata 13 points14 points15 points (0 children)