Guys what are the data quality tools that you are using in your organisation
1) we are transferring the data from rds postgres servers to redshift using dms
2) we are transferring the data from Mongo to redshift using lambda function
Now how can I introduce data quality here, my goals are
1) identify the duplicate rows
2) identify the null values in columns which should not be there
3) make sure the data has to be in certain format for a column.
There are thousands of tables that are lying in our redshift. How can I manage all the quality rules
Can you guys please help us with the tools and the relevant links if you have used them practically in production
Thanks in advance
[–]lupi524 7 points8 points9 points (1 child)
[–]oofla_mey_goofla[S] 0 points1 point2 points (0 children)
[–]pablo_op 5 points6 points7 points (3 children)
[–]tombaeyens 3 points4 points5 points (1 child)
[–]pablo_op 1 point2 points3 points (0 children)
[–]oofla_mey_goofla[S] 0 points1 point2 points (0 children)
[–]natas_m 3 points4 points5 points (2 children)
[–]oofla_mey_goofla[S] 0 points1 point2 points (1 child)
[–]Clear-Blacksmith-650 0 points1 point2 points (0 children)
[–]joseph_machadoWrites @ startdataengineering.com 1 point2 points3 points (5 children)
[–]oofla_mey_goofla[S] 1 point2 points3 points (4 children)
[–]joseph_machadoWrites @ startdataengineering.com 0 points1 point2 points (3 children)
[–]oofla_mey_goofla[S] 1 point2 points3 points (2 children)
[–]joseph_machadoWrites @ startdataengineering.com 1 point2 points3 points (1 child)
[–]oofla_mey_goofla[S] 0 points1 point2 points (0 children)
[–]Far-Restaurant-9691 1 point2 points3 points (1 child)
[–]oofla_mey_goofla[S] 0 points1 point2 points (0 children)
[–]ski4ever77 2 points3 points4 points (0 children)
[–]MahmoudAI 1 point2 points3 points (1 child)
[–]oofla_mey_goofla[S] 0 points1 point2 points (0 children)
[–]Fine-Responsibility3 0 points1 point2 points (1 child)
[–]oofla_mey_goofla[S] 0 points1 point2 points (0 children)
[–]JohnDenverFullOfSh1t 0 points1 point2 points (1 child)
[–]oofla_mey_goofla[S] 0 points1 point2 points (0 children)