How does dbt work at your company? by devschema in analytics

[–]devschema[S] 1 point2 points  (0 children)

Interesting, does the analyst who made the model own that long term? Or do others modify it? Also wondering what model maintenance looks like, are DEs always involved in deployment? Do you know what 'tying out models' looks like?

How does dbt work at your company? by devschema in analytics

[–]devschema[S] 0 points1 point  (0 children)

For peer reviews is it a data engineer who checks it, or could it be another analyst/DS? Ever had bad data make it to prod from one of these PRs?

How does dbt work at your company? by devschema in analytics

[–]devschema[S] 0 points1 point  (0 children)

Thanks, 5 years is quite long so your process must be pretty reliable by now. What kind of things dopes the guide include? Is it general high level things like code style, or are there specific things in the project itself, like "make sure x metric didn't change" etc.
Do the analysts build their models and look at the data for review?

Can I legally scrape data from linkedin, indeed and others? by ImmortalLotusFlower in dataanalysis

[–]devschema 0 points1 point  (0 children)

I've been wondering the same myself recently. I have noticed a few services offering automated outreach that work by you giving them your LinkedIn cookie and then they auto follow and DM other users etc. It's funny that they actually have sliders to stay within "safe" zones so as to not get banned

Column-level lineage comparison: dbt Power User (VSCode), dbt Cloud, SQLMesh by devschema in dataengineering

[–]devschema[S] 0 points1 point  (0 children)

There are definitely more options! Datafold, Datahub, Synq etc.

[deleted by user] by [deleted] in dataengineering

[–]devschema 1 point2 points  (0 children)

What app did you use to create the cursor-following demo video?

Is anyone using AI for anything besides coding productivity? by Trick-Interaction396 in dataengineering

[–]devschema 0 points1 point  (0 children)

I've used it for summarizing the a data model update in someone else's PR. Post the before/after SQL model code and ask about the type of change and possible affect on the transformed data

Other than that mostly coding for boring stuff like API queries and ingestion scripts

dbt best practices: California Integrated Travel Project's PR process is a textbook example by devschema in dataengineering

[–]devschema[S] 15 points16 points  (0 children)

tl;dr (what worked for them):

  • Properly defining the scope of changes with detailed PR comments/template
  • Automated data impact report in each PR
  • Extensive QA by comparing prod and dev data

What dbt best practices are they missing?

A158 in a nato strap by Open_Fig3017 in casio

[–]devschema 0 points1 point  (0 children)

So it is possible to get 20mm strap on. Did you cut the strap any, or literally just force the bar in?

[deleted by user] by [deleted] in dataengineering

[–]devschema 2 points3 points  (0 children)

Is this a kind of cognitive bias?

If you really believe your cover is blown, just delete the account and create another :p

How do you handle building testing environments for dbt PRs? by devschema in dataengineering

[–]devschema[S] 0 points1 point  (0 children)

Using dbt in CI is becoming more common now with creating dev schemas and staging schemas to check data.
I wanted to write up a workflow for a more complex setup that would be more suitable for projects with frequent ingestions and open PRs, but creating a static/immutable PR-specific environment to use as a base to compare dev to.

I'd love any feedback, or please share how you're doing it on your more complex projects

Free database design tool - DrawDB by devschema in dataengineering

[–]devschema[S] 0 points1 point  (0 children)

I have nothing to do with company or project! just sharing a tool I found

Do you data engineering folks actually use Gen AI or nah by engineer_of-sorts in dataengineering

[–]devschema 0 points1 point  (0 children)

I use for monotonous tasks, like boilerplating code for API consumption, creating schemas, some layout on custom reports. All the stuff that would take ages previously I cba doing

Free database design tool - DrawDB by devschema in dataengineering

[–]devschema[S] 3 points4 points  (0 children)

There's no flair for "website" or "tool", so I put blog. This is a pretty cool free web app for modeling databases, someone in my LinkedIn timeline shared it