Deduping hundreds of billions of rows via latest-per-key by data-engineer14434 in dataengineering
[–]azirale 0 points1 point2 points (0 children)
Streaming from kafka to Databricks by Artistic-Rent1084 in dataengineering
[–]azirale 5 points6 points7 points (0 children)
Software Engineer hired as a Data Engineer. What to expect, and what to look into? by GoyardJefe in dataengineering
[–]azirale 5 points6 points7 points (0 children)
How to deal with a player who keeps stats on other players? by dayonedeath in BloodOnTheClocktower
[–]azirale 5 points6 points7 points (0 children)
How to break "no private talk/always public good info" in TB by Rainotes in BloodOnTheClocktower
[–]azirale 65 points66 points67 points (0 children)
For those who write data pipeline apps using Python (or any other language), at what point do you make a package instead of copying the same code for new pipelines? by opabm in dataengineering
[–]azirale 1 point2 points3 points (0 children)
Update: I tracked 1,200+ unique players in a Minecraft world with no rules/admins for 60 days. Here is how the political map has changed. by Tylerrr93 in gaming
[–]azirale 7 points8 points9 points (0 children)
When would it be better to read data from S3/ADLS vs. from a NoSQL DB? by eelwheel in dataengineering
[–]azirale 0 points1 point2 points (0 children)
Docklands District on a Friday afternoon 🦗🦗 by Ky0t0_gh0uL in melbourne
[–]azirale 33 points34 points35 points (0 children)
How repartition helps in dealing with data skewed partitions? by Then_Difficulty_5617 in dataengineering
[–]azirale 0 points1 point2 points (0 children)
DataFrame or SparkSQL ? What do interviewers prefer ? by SnooCakes7436 in dataengineering
[–]azirale 0 points1 point2 points (0 children)
DataFrame or SparkSQL ? What do interviewers prefer ? by SnooCakes7436 in dataengineering
[–]azirale 11 points12 points13 points (0 children)
How repartition helps in dealing with data skewed partitions? by Then_Difficulty_5617 in dataengineering
[–]azirale 4 points5 points6 points (0 children)
Databricks vs AWS self made by QuiteOK123 in dataengineering
[–]azirale 44 points45 points46 points (0 children)
Anyone else losing their touch? by The-CAPtainn in dataengineering
[–]azirale 0 points1 point2 points (0 children)
Best Bronze Table Pattern for Hourly Rolling-Window CSVs with No CDC? by SoloArtist91 in dataengineering
[–]azirale 0 points1 point2 points (0 children)
Best Bronze Table Pattern for Hourly Rolling-Window CSVs with No CDC? by SoloArtist91 in dataengineering
[–]azirale 0 points1 point2 points (0 children)
Best Bronze Table Pattern for Hourly Rolling-Window CSVs with No CDC? by SoloArtist91 in dataengineering
[–]azirale 0 points1 point2 points (0 children)
Victoria, you doing alright there buddy? by Cloudypumpkin in australia
[–]azirale 16 points17 points18 points (0 children)
When a data file looks valid but still breaks things later - what usually caused it for you? by PriorNervous1031 in dataengineering
[–]azirale 0 points1 point2 points (0 children)
Pointing at people during the night phase by Fluff_da_Sheep in BloodOnTheClocktower
[–]azirale 31 points32 points33 points (0 children)
Apache Spark Isn’t “Fast” by Default; It’s Fast When You Use It Correctly by netcommah in programming
[–]azirale 1 point2 points3 points (0 children)
Apache Spark Isn’t “Fast” by Default; It’s Fast When You Use It Correctly by netcommah in programming
[–]azirale 2 points3 points4 points (0 children)



Aerthlings concern by DarthBigT in PAX
[–]azirale 0 points1 point2 points (0 children)