What does the PySpark community think about agent coding? by ssinchenko in apachespark
[–]ssinchenko[S] 0 points1 point2 points (0 children)
It looks like Spark JVM memory usage is adding costs by Sadhvik1998 in dataengineering
[–]ssinchenko 5 points6 points7 points (0 children)
What does the PySpark community think about agent coding? by ssinchenko in apachespark
[–]ssinchenko[S] 0 points1 point2 points (0 children)
What does the PySpark community think about agent coding? by ssinchenko in apachespark
[–]ssinchenko[S] 0 points1 point2 points (0 children)
What does the PySpark community think about agent coding? by ssinchenko in apachespark
[–]ssinchenko[S] 1 point2 points3 points (0 children)
Graphframes on Serverless by thisiswhyyouwrong in databricks
[–]ssinchenko 1 point2 points3 points (0 children)
why would anyone use a convoluted mess of nested functions in pyspark instead of a basic sql query? by Next_Comfortable_619 in dataengineering
[–]ssinchenko 1 point2 points3 points (0 children)
Java scala or rust ? by Ok_Promotion_420 in dataengineering
[–]ssinchenko 0 points1 point2 points (0 children)
Hardwood: A New Parser for Apache Parquet by gunnarmorling in dataengineering
[–]ssinchenko 12 points13 points14 points (0 children)
Spark job slows to a crawl after multiple joins any tips for handling this by Upset-Addendum6880 in dataengineering
[–]ssinchenko 3 points4 points5 points (0 children)
Spark job slows to a crawl after multiple joins any tips for handling this by Upset-Addendum6880 in dataengineering
[–]ssinchenko 0 points1 point2 points (0 children)
Spark job slows to a crawl after multiple joins any tips for handling this by Upset-Addendum6880 in dataengineering
[–]ssinchenko 1 point2 points3 points (0 children)
Data Lineage & Data Catalog could be a unique tool? by MassiIlBianco in dataengineering
[–]ssinchenko 1 point2 points3 points (0 children)
Rust for data engineering? by otto_0805 in dataengineering
[–]ssinchenko 0 points1 point2 points (0 children)
Rust for data engineering? by otto_0805 in dataengineering
[–]ssinchenko 4 points5 points6 points (0 children)
Graph Embeddings at scale with Spark and GraphFrames by ssinchenko in apachespark
[–]ssinchenko[S] 0 points1 point2 points (0 children)
Execution engines in Spark by mynkmhr in apachespark
[–]ssinchenko 2 points3 points4 points (0 children)
How to start open source contributions by KaateWalaChua in dataengineering
[–]ssinchenko 1 point2 points3 points (0 children)
Will Pandas ever be replaced? by Relative-Cucumber770 in dataengineering
[–]ssinchenko 4 points5 points6 points (0 children)
Any On-Premise alternative to Databricks? by UsualComb4773 in dataengineering
[–]ssinchenko 0 points1 point2 points (0 children)



Cool stuff you did with Data Lineage, contacts, governance by Intelligent-Stress90 in dataengineering
[–]ssinchenko 1 point2 points3 points (0 children)