deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

GitHub

3k stars
81 watching
536 forks
Language: Scala
last commit: 4 days ago
Linked from 3 awesome lists

dataqualityscalasparkunit-testing

Backlinks from these awesome lists: