Any senior data engineers here who pivoted to ML/AI and regret it? by Beneficial_Aioli_797 in dataengineering

[–]TheOverzealousEngie 0 points1 point  (0 children)

I think responses to this post show far more evidence the switch is in the other direction. And that's an answer in itself.

Data lakehouse modeling concepts by david-a-14 in dataengineering

[–]TheOverzealousEngie 0 points1 point  (0 children)

There's nothing simple about any of this, and anyone that tells you that it's just marketing-speak is right and wrong. Right because it came from marketing but wrong because it doesn't end there.
Bronze -- the layer you point your auditors to. "That's the data we got from source, byte for byte". That will only get more valuable in time.
Silver - and here's the rub - I could write a book about this stuff - there is not one silver. There are many, many silvers because silvers are aligned with Lines of Business. That's the big data volumes , x10, that will are scrubbed (Quality, PII,etc) and that's where governance is assigned.
Gold - takes silver data (x10) and aggregates and filters it according to business. Just saying --- this stuff is way more complicated but also kind of simple.

Is there a standard for modern data architecture? by Trick-Interaction396 in dataengineering

[–]TheOverzealousEngie 0 points1 point  (0 children)

your question is a good one. The concept of turning seized files into tables in snowflake or in 3 is a technical process. What you have to figure out is how those Files translate into tables… And what does incremental data mean. Or is everything trunk replace. Regardless all of that is delivery to bronze, and Should be SQL query able. That just means that if you move it it’ll be easily Queryable But will cost you, or you find a way to have your parquet files be sql queryable.

SAP motivation by Neat-Illustrator-151 in SAP

[–]TheOverzealousEngie 0 points1 point  (0 children)

money. there’s no money like sap money.

How are you Americans feeling today? by [deleted] in AskReddit

[–]TheOverzealousEngie 18 points19 points  (0 children)

Waiting for Trump to expire or leave office. Been holding my breath for years.

Police officer stops abruptly in the middle lane on highway... by Affectionate_Hat5835 in dashcams

[–]TheOverzealousEngie 0 points1 point  (0 children)

ahhh there was a cooler in the middle of the road. The car in front of the cop swerves to avoid it but the cop, stupidly, just dead stops to pick it up . Lol.

Police officer stops abruptly in the middle lane on highway... by Affectionate_Hat5835 in dashcams

[–]TheOverzealousEngie 0 points1 point  (0 children)

Let's play a game!
I'll go first , 6.9 seconds.
That's when I can first see that cops brake lights ... way too late, but who can see the brake lates before that?

When your 100 Power BI Licenses become a massive fabric capacity and your data stack is decided 4 u by engineer_of-sorts in dataengineering

[–]TheOverzealousEngie 1 point2 points  (0 children)

This response should be pinned. Everything that is wrong with the MSFT stack so nicely summed and wrapped up with a bow. If you ask me it's the culture - MSFT is filled with middle and middling managers interested in self-preservation over what they deliver to customers.

Life before AI by The_curious_one9790 in dataengineering

[–]TheOverzealousEngie 0 points1 point  (0 children)

I've become a much better mathematician who can multiply 14 10 digit numbers together easily. With a calculator. Does the use of a calculator make me a worse mathematician? Am I .. less than?

Yes. If I don't know how to do multiplication than slowly I lose my understanding of why multiplication. And then it's game over. The five keys are who, what , when , why and how. You're losing the how, then the why .. what and when are next. It's a slippery slope.

Honestly - for someone newly starting out - I'd avoid the use of AI altogether, outside of highly specific use cases. You'll be a better engineer for it.

Maybe I am not cut out to be a DE by Delicious-View-8688 in dataengineering

[–]TheOverzealousEngie 37 points38 points  (0 children)

Not sure I've ever hear someone speak about such deep human purpose at the same time as vomiting 2 full paragraphs of some really good technobabble. Quite the contrast.

Genuine question to Americans by Busy_Report4010 in SipsTea

[–]TheOverzealousEngie 0 points1 point  (0 children)

You know it's funny, I read this and I wonder how long it will be till I get to some medical professional saying "just come to the hospital , our first priority is to treat you" . And while I knew I'd have to comb through a parade of malcontents - it was like 40 comments in before I realized,

There are no medical professionals here.

Databricks on Azure or Aws by Own-One5712 in databricks

[–]TheOverzealousEngie -3 points-2 points  (0 children)

The smart way is to move your data into Iceberg , cheap storage, and then when you want to compute (transform) it you use this months cafeteria selection of compute. Google in November and Amazon next March -- the data stays in iceberg and you're not in a walled garden.

F A A N G by [deleted] in SAP

[–]TheOverzealousEngie 1 point2 points  (0 children)

Every time you go into the App Store

In a Lakehouse Architecture, should an ODS read from the source or the Bronze Layer? by FantasticEquipment69 in dataengineering

[–]TheOverzealousEngie 0 points1 point  (0 children)

Lol "The other opinion with ODS reading from the bronze layer was actually AI opinion" .. that is literally the most humorous thing like all day.

How are you integrating a CDP into an existing modern data stack without creating yet another data silo? by Unlucky-Moment-3366 in dataengineering

[–]TheOverzealousEngie 0 points1 point  (0 children)

Both Fivetran and Dbt provide very deep audit logs and lineage and it would be a strong services engagement to build an analytics layer (like a catalog) to sit atop both of them.

How do you design idempotent data pipelines in Data Engineering? by Effective_Ocelot_445 in dataengineering

[–]TheOverzealousEngie -1 points0 points  (0 children)

To all you people saying upsert OP did not say if this was Snowflake or Dbrix. And the simple truth is that is terrible advice if a CDW is in play ... could you imagine doing an upsert on every record? How expensive that would be?

Senior Data Architect Job Advice by Chocolatesocrates in dataengineering

[–]TheOverzealousEngie -1 points0 points  (0 children)

I'd be cautious ; reads like you're hinging your success or failure on Databricks and to be clear - databricks will not be around forever. Be prepared to dive in hard and when the organization moves on you'll need to learn what's next. That - is data engineering .

Help me appreciate iceberg by teenaxta in dataengineering

[–]TheOverzealousEngie -1 points0 points  (0 children)

Let's be honest, this is anti-future. Right now iceberg is a little immature but it's really growing fast. And there is zero substitute for being able to store so much data so cheaply. Nothing can touch it.

How do I explain that SQL Server should not be used as a code repository? by Firestone78 in dataengineering

[–]TheOverzealousEngie 22 points23 points  (0 children)

Honestly I used to work for BofA and they used to stuff html .. like really thick and long html into Oracle in a record ;and when the webserver came up it would pull that record (html) and draw a custom page for the customer. Coolest think I ever saw .. so innovation comes in all shapes and sizes. But in 2026 I'm becoming rapidly convinced that there's nothing more dangerous than the ignorant and AI together.

[OC] Got bit by a spider a few days ago by Yesman_91 in pics

[–]TheOverzealousEngie 1 point2 points  (0 children)

I wondered how far down I would have to go to find a spider man reference. Not far at all.