python-deequ

Data profiler

A Python API for defining unit tests for data quality in large datasets

Python API for Deequ

GitHub

730 stars
17 watching
135 forks
Language: Jupyter Notebook
last commit: about 1 month ago

Related projects:

Repository Description Stars
realpython/django-slow-tests Identifies and reports on the slowest tests in a Django application 182
wq/django-rest-pandas An API that serves pandas DataFrames via Django REST Framework for data visualization and offline analysis. 1,254
nlgranger/seqtools A Python library to manipulate and transform indexable data 48
doloopwhile/pyjq A Python binding for a JSON processor that allows transforming and filtering structured data 196
django-query-profiler/django-query-profiler Helps identify slow Django applications by analyzing and visualizing database queries 139
dodger487/dplython A Python implementation of data manipulation functions inspired by the R package Dplyr. 764
whamlyn/auralib A collection of utility classes and functions for geoscience data analysis and manipulation. 33
awslabs/aws-security-automation Automated incident response and security remediation tools for AWS services 620
jdiasn/lidarwind A Python package that retrieves wind profiles from Doppler lidar observations. 15
django-haystack/pysolr Provides a lightweight Python interface to interact with Apache Solr search engine 667
mohamedadaly/labr A dataset of Arabic book reviews for natural language processing tasks 44
futurulus/wiinaq A dictionary web application with automatically generated tables and enhanced search capabilities. 2
maluuba/newsqa Compiles and provides structured access to Maluuba's NewsQA dataset for natural language question answering research. 253
wikier/djubby A Linked Data frontend for SPARQL endpoints in Django, providing an interface to access and manipulate data using standard web protocols. 18
apmonitor/data_science An online course using Python to analyze data and develop predictive models for heat transfer applications 80