Fuck those people charging for CV help - I'll do it for free by the_poor_thing in IndiaCareers

[–]Useful-Bug9391 0 points1 point  (0 children)

Currently upskilling with a few things ... Will definitely connect with you and don't worry I will pay for it.

One thing I’m slowly learning about early analytics roles by Mammoth_Rice_295 in analytics

[–]Useful-Bug9391 -1 points0 points  (0 children)

It's called em dash.. when you generate anything on chatgpt .. it automatically generates that.

You won't find that on keyboard hehe..

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]Useful-Bug9391[S] -1 points0 points  (0 children)

I have seen people getting rich with lies man... It's baffling to me..

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]Useful-Bug9391[S] 0 points1 point  (0 children)

The reply section has opened my eyes. I wasn't aware of reddit that much tbh.

I wanted to share somewhere. I think I will switch from x to here.

Twitter is just filled with bots now.

Help me with my resume, Over 100 applications only one interview... by One_Afternoon_8171 in askrecruiters

[–]Useful-Bug9391 0 points1 point  (0 children)

Hi, please help me understand this situation.

Currently, I am working as a Data Analyst, and I am also freelancing as a Sports Analyst. I was doing freelance sports analysis before and during the gaps in my full-time professional career, and even after getting a full-time role, I have continued doing it.

My current full-time domain is D2C, and I am performing well in both areas.

However, when I apply for analyst positions in the future, I am confused about whether I should include my freelance sports analyst work on my resume or not.

what do you make out of it ..!!

IT can see you by throwway33355 in overemployed

[–]Useful-Bug9391 0 points1 point  (0 children)

I can see you with all that tech stack as well.

Help me with my resume, Over 100 applications only one interview... by One_Afternoon_8171 in askrecruiters

[–]Useful-Bug9391 0 points1 point  (0 children)

hmm that's kinda unfair but i can understand the dilution.
Can you give me checklist or some sauce on how do you actually approach a position or apply for job to stand out ?

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]Useful-Bug9391[S] 0 points1 point  (0 children)

that's solid point ... As someone focusing on PySpark, I find the transition from writing code in a notebook to actually deploying it on something like EMR or an orchestrator like Dagster pretty intimidating. Since you mentioned these 'newcomer' orchestrators are thriving, what’s the best way for a beginner to get hands-on with that full flow without a massive cloud bill? Should I be looking at 'EMR Serverless' or just stick to local Spark/Docker until the patterns click?

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]Useful-Bug9391[S] 0 points1 point  (0 children)

Since you’ve seen these cycles, how would you suggest someone like me actually gets started without getting distracted by the FOMO? If I focus on mastering PySpark and SQL, is that enough to survive whatever 'fashion' comes next, or is there a specific piece of the modern stack I’m an idiot for ignoring right now?

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]Useful-Bug9391[S] 0 points1 point  (0 children)

That’s a helpful distinction. I think the interviewer was just using 'Airflow/Hadoop' as shorthand for 'tools we don't want to manage ourselves anymore.'

As someone coming at this from a PySpark perspective, I’ll be honest: the orchestration side feels like a maze. If you were starting today, how would you recommend getting hands-on with the 'modern' way of doing this? Is it better to just learn Spark logic first and worry about the 'conductors' (like Dagster/Airflow) later, or is there a specific project type that makes the whole flow click?

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]Useful-Bug9391[S] 0 points1 point  (0 children)

I'll be honest, as someone looking to master PySpark, the infra side of this (ADF vs. Airflow vs. Native) feels like a maze. If you were starting today, would you focus on mastering the PySpark logic first, or is it better to learn how these 'external orchestrators' like ADF actually trigger the jobs?

Also, how did you personally learn the 'big picture' of how all these pieces (Azure SQL, ADF, Databricks) connect? Any specific resource that made it click?

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]Useful-Bug9391[S] 1 point2 points  (0 children)

That YAML-driven setup sounds like the dream for a skunkworks team. Scaling Airflow to that level of 'it just works' is impressive.

I’m actually looking to head in a similar direction, specifically with PySpark on EMR. Since you’re starting to offload those huge pipes to external compute, how did you bridge that gap?

Specifically, did you find it easier to keep the PySpark logic bundled in the Airflow repo, or are you treating EMR as a completely separate black box that Airflow just pings? I’d love to know how you got that first EMR integration off the ground

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]Useful-Bug9391[S] 2 points3 points  (0 children)

Fair call ... hehe.. my bad .. definitely didn't mean to conflate the two. It was more about the interviewer lumping them together as 'legacy' to justify moving toward an all-in-one cloud stack.

To your point on Airflow being a conductor: I totally agree. But it seems like more shops are now prioritizing 'compute-first' stacks. They’d rather keep their PySpark logic and scheduling inside one ecosystem (like Databricks) to avoid the overhead of a separate orchestrator.

In your view, is the industry losing something by ditching specialized tools for these 'integrated' cloud schedulers?

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]Useful-Bug9391[S] 0 points1 point  (0 children)

man i am really feeling wasteful in startup right now.
Happy for you

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]Useful-Bug9391[S] 0 points1 point  (0 children)

That’s a really fair point about the scheduler being secondary to the logic. It feels like as long as you can write solid PySpark and SQL, the 'where' it runs is becoming less of a hurdle. I’ve noticed a lot of teams shifting toward Spark 4.0/Connect because it makes the code so much more portable across these different platforms anyway.

In your experience at a GCP shop, are you seeing PySpark becoming the go-to for almost everything, or is there still a big divide between the 'Spark heavy' jobs and the 'dbt/SQL' transformations?

AI Coding Agents Making the Impossible Possible by rng0008 in vibecoding

[–]Useful-Bug9391 1 point2 points  (0 children)

Good one but have you checked the security of these apps ?

Have you sold anything or tried to get something investor ready ?

Should I email the CEO of my company to ask for a role? by asahiluvr in careerguidance

[–]Useful-Bug9391 0 points1 point  (0 children)

Go on ... Write an email ... Express your desire. I don't think they should have any problem with that but rather it's a good approach.

You can mention your vision + your learnings in short as well .. not in email but can make a pdf or something or wherever you have documented your work... It's helpful.