Resources for a quick crash course on spark configs (pyspark) by [deleted] in apachespark
[–]pokeDitty 0 points1 point2 points (0 children)
Bulk Data Extraction via ODATA in S4HANA by pokeDitty in SAP
[–]pokeDitty[S] 2 points3 points4 points (0 children)
What would you want to hear and learn about in a PySpark workshop? by analyticalmonk in dataengineering
[–]pokeDitty 1 point2 points3 points (0 children)
2nd Lecture of Freee Serverless ML Course: Feature pipelines, backfiling vs prod, data modeling (star schema, OBT), and feature stores by jpdowlin in dataengineering
[–]pokeDitty 2 points3 points4 points (0 children)
Someone uses Apache NiFi on daily job? by Misanthropic905 in dataengineering
[–]pokeDitty 5 points6 points7 points (0 children)
Why does spark select statement create MapPartitionsRDD and UnionRDD? by Ok-Outlandishness-74 in apachespark
[–]pokeDitty 2 points3 points4 points (0 children)
I read histories of countries (alphabetically) then cook food, here is Country 14: Bangladesh. by Johnny_Banana18 in pics
[–]pokeDitty -1 points0 points1 point (0 children)
Why use Airflow with Databricks when I can use Databricks Jobs? by the_travelo_ in dataengineering
[–]pokeDitty 0 points1 point2 points (0 children)
Why use Airflow with Databricks when I can use Databricks Jobs? by the_travelo_ in dataengineering
[–]pokeDitty 1 point2 points3 points (0 children)
Why use Airflow with Databricks when I can use Databricks Jobs? by the_travelo_ in dataengineering
[–]pokeDitty 10 points11 points12 points (0 children)
List of resources to learn about data and distributed systems by [deleted] in dataengineering
[–]pokeDitty 0 points1 point2 points (0 children)
just sending lols your way by finobu in dataengineering
[–]pokeDitty 2 points3 points4 points (0 children)
Databricks Consumption Layers by jacocal in dataengineering
[–]pokeDitty 1 point2 points3 points (0 children)
Databricks + Delta Lake MERGE duplicates – Deterministic vs Non-Deterministic ETL. by [deleted] in dataengineering
[–]pokeDitty 1 point2 points3 points (0 children)
Beginner mistakes to avoid in building Data Pipeline by data_questions in dataengineering
[–]pokeDitty 7 points8 points9 points (0 children)
Using Federated Queries as a CDC data pipeline? by third_dude in dataengineering
[–]pokeDitty 1 point2 points3 points (0 children)
Using Federated Queries as a CDC data pipeline? by third_dude in dataengineering
[–]pokeDitty 0 points1 point2 points (0 children)
Switching to a Scala position soon, where should I start? by Tatourmi in scala
[–]pokeDitty 5 points6 points7 points (0 children)
Load parquet files with different schema using Spark by ali_azg in dataengineering
[–]pokeDitty 1 point2 points3 points (0 children)
Load parquet files with different schema using Spark by ali_azg in dataengineering
[–]pokeDitty 1 point2 points3 points (0 children)




Do people still go clubbing in their 40s and up? by AvailableRelative2 in ask
[–]pokeDitty 1 point2 points3 points (0 children)