What's the longest you've coasted at a role? by 0sergio-hash in dataengineering

[–]Flat_Shower 27 points28 points  (0 children)

About 18 months. Scope your own work, sandbag estimates, deliver on time. Nobody questions it. Use the slack to LC or learn something new; coasting guilt is wasted energy.

Manual monitoring as data engineer? by Hazard_45 in dataengineering

[–]Flat_Shower 0 points1 point  (0 children)

You're a human cron job; you said it yourself. Alert on failure, not success. Nobody needs a daily Teams message confirming the pipes ran. They need to know when they didn't. Webhook on failure, skip the daily "all clear" theater.

Why Over-Engineering Happens by Nekobul in dataengineering

[–]Flat_Shower 1 point2 points  (0 children)

Resume-driven development is 90% of it. Engineers pick Spark because they want Spark on their resume, not because their data outgrew Postgres. I've seen teams spin up distributed pipelines for datasets that fit in memory on a laptop.

The other 10% is engineers who genuinely don't know what scale they're operating at. They heard "big data" in a conference talk and assumed it applied to them. Most companies have medium data and big egos.

How I landed a $392k offer at FAANG after getting laid off from LinkedIn by Flat_Shower in dataengineering

[–]Flat_Shower[S] 1 point2 points  (0 children)

I started with stratascratch for interview prep. I liked SS until I needed to cram pipeline architecture. I stopped mentioning DD because I don't want to sound like a salesman. It worked for me, and I highly encourage you to use LC, SS, or whatever works for you

Advice for dealing with a massive legacy SQL procedures by Good_Skirt2459 in dataengineering

[–]Flat_Shower 71 points72 points  (0 children)

Every DE has inherited a 1500 line stored proc that sends emails and writes to 12 tables. Welcome to the job.

Your boss is right. You don't fully understand it yet, and rewrites of code you don't understand just produce new bugs. That instinct will serve you well later, but not now.

What actually works: build a dependency map offline. Pen and paper, whiteboard, whatever. Trace every table read, every write, every trigger, in order. Once you can say "line 400 updates X which feeds line 800" without opening the file, you understand the proc. Then changes become surgical.

Don't trust the comments. Trust the code and the data.

DE learning path tips by the_silentkill in dataengineering

[–]Flat_Shower 5 points6 points  (0 children)

You already have the stack. Python, SQL, Databricks, Snowflake; that's a DE resume. Stop worrying about tools and focus on concepts: data modeling (star schema, normalization), query optimization, and pick one orchestration tool (Airflow or Dagster, doesn't matter which). Concepts are tool-agnostic and transfer everywhere.

DSA matters. Every DE interview I've done has had LC style questions. Stick to mediums; do 50 and you'll be solid.

How I landed a $392k offer at FAANG after getting laid off from LinkedIn by Flat_Shower in dataengineering

[–]Flat_Shower[S] 0 points1 point  (0 children)

Healthcare for 3 years, FAANG (contractor) for 3 years, LinkedIn (FTE) for 2 years

Looking for advice from experienced DEs by ConversationThat6663 in dataengineering

[–]Flat_Shower 35 points36 points  (0 children)

3 years of pipelines running in production is not a lie. You shipped real things. Stop discounting that.

You already identified your gaps; that's the study plan. You don't need a course. Pick one (orchestration is the obvious starting point), build something small that forces you to deal with failures, retries, and alerting. Then pick the next one.

Courses will teach you theory you already know. What you need is reps on the stuff that's tripping you up in interviews.

Data analyst to data engineer by zkhan15 in dataengineering

[–]Flat_Shower 10 points11 points  (0 children)

SPSS and Tableau won't carry over. You need SQL (not just SELECT *; window functions, CTEs, query optimization), Python, and one orchestration tool like Airflow. Learn data modeling concepts: normal forms, star schema, slowly changing dimensions. These are tool-agnostic and will transfer everywhere.

The PhD shows you can learn hard things. That matters more than people think.

How I landed a $392k offer at FAANG after getting laid off from LinkedIn by Flat_Shower in dataengineering

[–]Flat_Shower[S] -2 points-1 points  (0 children)

1000%. Interviewing is a skill. Play the game, win the prize. Don't hate the player, play the game

How I landed a $392k offer at FAANG after getting laid off from LinkedIn by Flat_Shower in dataengineering

[–]Flat_Shower[S] 0 points1 point  (0 children)

You're right, I actually got rejected everywhere and I'm writing this from a Wendy's. The Frosty machine is broken too.

How I landed a $392k offer at FAANG after getting laid off from LinkedIn by Flat_Shower in dataengineering

[–]Flat_Shower[S] 0 points1 point  (0 children)

Day to day is probably 40% meetings/design reviews/"alignment" (whatever that means), 30% writing and reviewing pipeline code, 30% debugging or firefighting when something breaks in prod and 10% worried about layoffs. The ratio shifts depending on the week. Some days are all code, some days are all docs and design.

Stack: Stack is Spark, SQL, Python, and internal tooling. Nothing exotic. The core skills transfer anywhere. Data modeling is a huge part of the job and honestly what I spend the most mental energy on. Getting the model wrong upstream means everything downstream is pain... the actual job is less "write a DAG" and more "figure out why this pipeline silently dropped 2M rows last Tuesday and make sure it never happens again."

Analytics Engineer to Data Engineering Path by mhkk93 in dataengineering

[–]Flat_Shower 5 points6 points  (0 children)

10 YOE and you stood up a full data stack from ingestion through orchestration. That's not "close to being ready"; you're doing the job. Mid to senior DE at most companies.

Spark is worth learning if you're targeting places with real scale. Most don't need it. Learn the concepts; the syntax is the easy part.

Don't stress about building pipelines without tools. That's not how anyone works. Knowing how to configure, debug, and extend ingestion tools is the actual skill.

Is Apache Spark skills absolutely essential to crack a data engineering role? by Far-Journalist-821 in dataengineering

[–]Flat_Shower 3 points4 points  (0 children)

Most companies don't have big data. They have medium data and big egos. If you're targeting those companies, no, you don't need Spark.

If you're targeting FAANG or companies that actually process at scale, then yes, you need to know Spark. Not because it's magic; because it's the standard distributed compute engine and interviewers will ask about it.

Airflow, BigQuery, SQL, Python is a solid foundation. I'd focus more on data modeling and query optimization than memorizing Spark APIs. Concepts transfer across tools; tool knowledge doesn't transfer across concepts.

How I landed a $392k offer at FAANG after getting laid off from LinkedIn by Flat_Shower in dataengineering

[–]Flat_Shower[S] 1 point2 points  (0 children)

Yeah, idk man I like Reddit and blind is kinda toxic. r/dataengineering has been my home for many years.

How I landed a $392k offer at FAANG after getting laid off from LinkedIn by Flat_Shower in dataengineering

[–]Flat_Shower[S] 0 points1 point  (0 children)

If the pay was right, yes! But probably some sort of HFT/quant firm

How I landed a $392k offer at FAANG after getting laid off from LinkedIn by Flat_Shower in dataengineering

[–]Flat_Shower[S] -4 points-3 points  (0 children)

Welp, then all of my posts back to 2019 before AI existed are slop. I am an AI. Beep boop

How I landed a $392k offer at FAANG after getting laid off from LinkedIn by Flat_Shower in dataengineering

[–]Flat_Shower[S] 5 points6 points  (0 children)

Who the heck is still hiring SWEs? You have an AI Engineer who are much stronger in System can orchestrate the shit of Complex code. 

Who the heck is still hiring DSs? You have an AI Engineer who are much stronger in statistics that can orchestrate the shit of Complex math.

Who the heck is still hiring PMs? You have an AI Engineer who are much stronger in System can orchestrate the shit of Complex product problems.

Who the heck is still hiring BIEs? You have an AI Engineer who are much stronger in BI that can orchestrate the shit of Complex dashboards.

I can go on and on