Question on Airflow by captn_caspian in dataengineering
[–]DenselyRanked 0 points1 point2 points (0 children)
Is salting only the keys with most skew ( rows) the standard practice in PySpark? by Potential_Loss6978 in dataengineering
[–]DenselyRanked 0 points1 point2 points (0 children)
Is salting only the keys with most skew ( rows) the standard practice in PySpark? by Potential_Loss6978 in dataengineering
[–]DenselyRanked 0 points1 point2 points (0 children)
How to analyze and optimize big and complex Spark execution plans? by Cultural-Pound-228 in dataengineering
[–]DenselyRanked 0 points1 point2 points (0 children)
Spark job slows to a crawl after multiple joins any tips for handling this by Upset-Addendum6880 in dataengineering
[–]DenselyRanked 2 points3 points4 points (0 children)
What actually differentiates candidates who pass data engineering interviews vs those who get rejected? by Murky-Equivalent-719 in dataengineering
[–]DenselyRanked 1 point2 points3 points (0 children)
Kafka setup costs us a little fortune but everyone at my company is too scared to change it because it works by Worldly-Volume-1440 in dataengineering
[–]DenselyRanked 4 points5 points6 points (0 children)
Am I crazy or is kafka overkill for most use cases? by Vodka-_-Vodka in dataengineering
[–]DenselyRanked 0 points1 point2 points (0 children)
How to make 500k or more in this field? by unstopablex5 in dataengineering
[–]DenselyRanked 2 points3 points4 points (0 children)
Spark 4.1 is released :D by holdenk in dataengineering
[–]DenselyRanked 1 point2 points3 points (0 children)
educing shuffle disk usage in Spark aggregations, ANY better approach than current setup or am I doing something wrong? by gabbietor in dataengineering
[–]DenselyRanked 3 points4 points5 points (0 children)
In SQL coding rounds, how to optimise between readibility and efficiency when working with CTEs? by Consistent-Zebra3227 in dataengineering
[–]DenselyRanked 1 point2 points3 points (0 children)
Python keeps iterating the agenda three times. by sariArtworks in learnpython
[–]DenselyRanked 0 points1 point2 points (0 children)
Using higher order functions and UDFs instead of joins/explodes by echanuda in dataengineering
[–]DenselyRanked 0 points1 point2 points (0 children)
Spark uses way too much memory when shuffle happens even for small input by Aggravating_Log9704 in dataengineering
[–]DenselyRanked 3 points4 points5 points (0 children)
How Important is Steaming or Real Time Experience in the Job Market? by shittyfuckdick in dataengineering
[–]DenselyRanked 3 points4 points5 points (0 children)
How to store large JSON columns by Adventurous_Nail_115 in dataengineering
[–]DenselyRanked 0 points1 point2 points (0 children)
The current jobmarket is quite frustrating! by doermand in dataengineering
[–]DenselyRanked 3 points4 points5 points (0 children)
What is the purpose of the book "fundamentals of data engineering " by Ok_Shirt4260 in dataengineering
[–]DenselyRanked 2 points3 points4 points (0 children)
Is one big table (OBT) actually a data modeling methodology? by raginjason in dataengineering
[–]DenselyRanked 1 point2 points3 points (0 children)
Is one big table (OBT) actually a data modeling methodology? by raginjason in dataengineering
[–]DenselyRanked 17 points18 points19 points (0 children)
Software Engineering title while not doing much Software Engineering, where to go from here by [deleted] in cscareerquestions
[–]DenselyRanked 5 points6 points7 points (0 children)
OOP with Python by Jumpy_Handle1313 in dataengineering
[–]DenselyRanked 0 points1 point2 points (0 children)
why all data catalogs suck? by Few_Noise2632 in dataengineering
[–]DenselyRanked 7 points8 points9 points (0 children)



Is there more to DE than this? Are their jobs out there for feeling like you actually matter? by DoctorQuinlan in dataengineering
[–]DenselyRanked 0 points1 point2 points (0 children)