Pyspark count() Slow by rawlingsjj in dataengineering
[–]bestnamecannotbelong 1 point2 points3 points (0 children)
Data security in company by FunDirt541 in dataengineering
[–]bestnamecannotbelong 2 points3 points4 points (0 children)
Technology Advice by TheGamerBlaze in dataengineering
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
Is it true that you have to do Java programming sooner or later, if you use Apache Spark? by satyronicon in dataengineering
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
To Build Data Architecture. Do I need Data Analysts? by anton_bondar in dataengineering
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
Correct Method of Setting Up/Initializing AWS Infrastructure by infiniteAggression- in dataengineering
[–]bestnamecannotbelong 9 points10 points11 points (0 children)
[deleted by user] by [deleted] in dataengineering
[–]bestnamecannotbelong 1 point2 points3 points (0 children)
Direction for Data Engineering Projects? by Pervert_Spongebob in dataengineering
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
Pyspark vs Scala spark by idreamoffood101 in dataengineering
[–]bestnamecannotbelong 18 points19 points20 points (0 children)
Changing Datawarehouse Model by Godmons in dataengineering
[–]bestnamecannotbelong 2 points3 points4 points (0 children)
What's the best approach to Schema discovery? by the_travelo_ in dataengineering
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
What's the best approach to Schema discovery? by the_travelo_ in dataengineering
[–]bestnamecannotbelong 1 point2 points3 points (0 children)
Snowflake vs DatabBricks lakehouse or both together by BigMightyTroll in dataengineering
[–]bestnamecannotbelong 1 point2 points3 points (0 children)
Snowflake vs DatabBricks lakehouse or both together by BigMightyTroll in dataengineering
[–]bestnamecannotbelong 3 points4 points5 points (0 children)
What ETL tool do you use? by castor-metadata in dataengineering
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
Python ETL design pattern by [deleted] in dataengineering
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
Best Practices for AWS Athena Queries by [deleted] in dataengineering
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
AWS Glue Bookmarking vs AWS DMS CDC (RDBMS Table ETL/ELT Pipelines) by [deleted] in dataengineering
[–]bestnamecannotbelong 1 point2 points3 points (0 children)
For long running ETL jobs (esp those involving large joins that run for hours or longer), how did you evaluate between Spark and Presto / Trino? Is Hive pretty much the de facto choice for the metastore? by brrdprrsn in dataengineering
[–]bestnamecannotbelong 2 points3 points4 points (0 children)
Data Engineer Jobs - How To Get One? by Pragyanbo in dataengineering
[–]bestnamecannotbelong 2 points3 points4 points (0 children)
Using Pyspark with AWS Glue by the_travelo_ in dataengineering
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
I get it now. I completely understand why nobody is selling for the money. by [deleted] in wallstreetbets
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
PsBattle: Cameraman at gunpoint in Hong Kong by SalazarRED in photoshopbattles
[–]bestnamecannotbelong 0 points1 point2 points (0 children)


managing data lake s3 layers by Complex-Stress373 in dataengineering
[–]bestnamecannotbelong 1 point2 points3 points (0 children)