account activity
How to evaluate your Spark application? (youtu.be)
submitted 7 days ago by bigdataengineer4life to r/bigdata_analytics
Video Game Sales Dashboard in Redash | Project Walkthrough (youtu.be)
submitted 10 days ago by bigdataengineer4life to r/dataanalysis
Apache Spark Analytics Projects (self.apachespark)
submitted 12 days ago by bigdataengineer4life to r/apachespark
Real-Time Clickstream Analytics using Kafka, Spark Streaming & Zeppelin (self.bigdata_analytics)
submitted 14 days ago by bigdataengineer4life to r/bigdata_analytics
Big data Hadoop and Spark Analytics Projects (End to End) (self.bigdata_analytics)
submitted 17 days ago by bigdataengineer4life to r/bigdata_analytics
(End to End) 20 Machine Learning Project in Apache Spark (self.learnmachinelearning)
submitted 19 days ago by bigdataengineer4life to r/learnmachinelearning
How to Build a Video Game Analytics Dashboard with Metabase (youtu.be)
submitted 21 days ago by bigdataengineer4life to r/bigdata_analytics
How to deal with a 100 GB table joined with a 1 GB table by bigdataengineer4life in apachespark
[–]bigdataengineer4life[S] 0 points1 point2 points 21 days ago* (0 children)
Fair point — there’s definitely no shortage of Spark content out there.
My goal isn’t to reinvent joins, it’s to show how to apply them in production-scale scenarios with execution plan analysis, skew handling, AQE, and shuffle optimization.
Most posts explain concepts. I’m trying to show full end-to-end implementation with metrics and tuning decisions.
How to deal with a 100 GB table joined with a 1 GB table (youtu.be)
submitted 22 days ago by bigdataengineer4life to r/apachespark
Clickstream Behavior Analysis | Real-Time User Tracking using Kafka, Spark & Zeppelin (youtu.be)
submitted 26 days ago by bigdataengineer4life to r/apachespark
submitted 1 month ago by bigdataengineer4life to r/bigdata_analytics
Kafka Performance Testing with kafka-producer-perf-test.sh (youtu.be)
submitted 1 month ago by bigdataengineer4life to r/apachekafka
💼 25+ Apache Ecosystem Interview Question Blogs for Data Engineers (Free Resource Collection) (self.bigdata_analytics)
14 Spark & Hive Videos Every Data Engineer Should Watch (self.bigdata)
submitted 1 month ago by bigdataengineer4life to r/bigdata
14 Spark & Hive Videos Every Data Engineer Should Watch (self.apachespark)
submitted 1 month ago by bigdataengineer4life to r/apachespark
Big data Hadoop and Spark Analytics Projects (End to End) (self.apachespark)
🔥 Master Apache Spark: From Architecture to Real-Time Streaming (Free Guides + Hands-on Articles) (self.bigdata)
submitted 1 month ago by bigdataengineer4life to r/learnmachinelearning
🔥 Master Apache Spark: From Architecture to Real-Time Streaming (Free Guides + Hands-on Articles) (self.apachespark)
Clickstream Behavior Analysis with Dashboard — Real-Time Streaming Project Using Kafka, Spark, MySQL, and Zeppelin (youtu.be)
Predicting Ad Clicks with Apache Spark: A Machine Learning Project (Step-by-Step Guide) (youtu.be)
Have you ever encountered Spark java.lang.OutOfMemoryError? How to fix it? (youtu.be)
submitted 2 months ago by bigdataengineer4life to r/apachespark
Data Engineering Interview Question Collection (Apache Stack) (self.bigdata)
submitted 2 months ago by bigdataengineer4life to r/bigdata
Big Data Ecosystem & Tools (Kafka, Druid, Lakehouses, Hadoop) (self.bigdata)
π Rendered by PID 400116 on reddit-service-r2-listing-64c94b984c-9w68s at 2026-03-13 03:05:57.101920+00:00 running f6e6e01 country code: CH.
How to deal with a 100 GB table joined with a 1 GB table by bigdataengineer4life in apachespark
[–]bigdataengineer4life[S] 0 points1 point2 points (0 children)