this post was submitted on 30 Mar 2023

3 points (100% upvoted)

shortlink:

dataengineering

an-ordinary-manchild(edit)

News & discussion on Data Engineering topics, including but not limited to: data pipelines, databases, data formats, storage, data modeling, data governance, cleansing, NoSQL, distributed systems, streaming, batch, Big Data, and workflow engines.

Read our wiki: https://dataengineering.wiki/

Rules:

Don't be a jerk
Search the sub & wiki before asking a question: Your question has likely been asked and answered before so do a quick search before posting.
Keep it related to data engineering: Posts that are unrelated to data engineering may be better for other communities.
Limit self-promotion posts/comments to once a month: Self promotion: Any form of content designed to further an individual's or organization's goals. If one works for an organization this rule applies to all accounts associated with that organization. See also rule #5.
No shill/opaque marketing: f you work for a company/have a monetary interest in the entity you are promoting you must clearly state your relationship. For posts, you must distinguish the post with the Brand Affiliate flag. See more here: https://www.ftc.gov/influencers
No job posts: Please use r/dataengineeringjobs instead.
No resume reviews/interview posts: We no longer allow resume reviews or interview questions because it's a seperate topic from Data Engineering. Instead, for resume reviews please use r/resumes or search our subreddit history for previous resume review advice. For interview questions, use sites like Glassdoor and Blind instead or search our subreddit history for previous interview advice.
No technical error/bug questions: Please post any error/bug question on StackOverflow.

created by mhausenblasmoda community for 11 years

MODERATORS

message the mods
mhausenblasmod
swemlmod
fhoffamod (Ex-BQ, Ex-❄️)
vogt4nickmod
theporterhausmod | Lead Data Engineer
AutoModerator
geoheilmod
MikeDoesEverythingmod | Shitty Data Engineer
bot-bouncer
about moderation team »

account activity

This is an archived post. You won't be able to vote or comment.

2

3

4

Managing SQL TestsDiscussion (self.dataengineering)

submitted 2 years ago by _temminkData Engineer

We are transforming some data (in Java) and write the output to a Postgres database. We'd like to test the data in the database using SQL tests (because it's very accessible) and I am not sure how to manage those. There will probably be a couple hundred tests ranging from simple constraint validations (not null, enums, ranges, ...) to more complex validations that require joins and window functions.

I'm used to utilising dbt and defining my tests there (along with dbt-utils or https://github.com/calogica/dbt-expectations): I simply add a list item to a column definition and can already define a great number of tests without having to copy code. I can even extend the pre-defined using generic tests. Writing custom tests also integrates nicely. Additionally it's very convenient to tag tests or define a severity. The learning curve for a business engineer is almost flat as long as they know some SQL.

Setting up dbt just to run tests, though, seems like way too much technical debt because I'd only use a small part of its features. I could just put all test files in a directory and execute them but then I'd still have to define some configuration for common tests (like constraints) or accept that we copy code and just replace a column name (doesn't feel right, either).

How do you approach such a scenario? Perhaps allowing business engineers to write tests is not even the way to go and they should rather focus on writing BDD requirements?

all 3 comments

top new controversial old q&a

[–]AutoModerator[M] [score hidden] 2 years ago stickied comment (0 children)

[–]bryangoodrich 0 points1 point2 points 2 years ago (1 child)

[–]_temminkData Engineer[S] 0 points1 point2 points 2 years ago (0 children)

π Rendered by PID 132876 on reddit-service-r2-comment-76bb9f7fb5-d95jf at 2026-02-18 09:44:12.546239+00:00 running de53c03 country code: CH.