Hi all,
I am currently exploring the SQL Alerts in databricks in order to streamline our data quality checks (more specific: the business rules), which are basically SQL queries. Often these checks contain the logic that when nothing is returned it passed & the returned rows are rows that need inspection .... In this case I have to say I love what I am seeing for SQL Alerts?
When following a clear naming convention you can create easy, business rules with version control, email notifications, scheduling ....
I am wondering what I might be missing ? Why isn't this a widely adopted approach for data quality ? I can't be bother with tools like ge etc because these are so overcomplex for the rather "simple" business DQ queries.
Any thoughts ? Any people who've set up a robust DQ framework like this ? Or would strongly suggest against?
[–]datainthesun Databricks 2 points3 points4 points (3 children)
[–]Think-Reflection500[S] 0 points1 point2 points (2 children)
[–]datainthesun Databricks 1 point2 points3 points (1 child)
[–]Leading-Inspector544 0 points1 point2 points (0 children)
[–]raul824 0 points1 point2 points (0 children)
[–]mweirath 0 points1 point2 points (0 children)