deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

GitHub

3k stars
81 watching
533 forks
Language: Scala
last commit: 9 days ago
Linked from 2 awesome lists

dataqualityscalasparkunit-testing

Backlinks from these awesome lists: