great_expectations

Data quality testing framework

Provides tools and techniques to ensure data quality by defining expected outcomes for data processing pipelines.

Always know what to expect from your data.

GitHub

10k stars
85 watching
2k forks
Language: Python
last commit: about 1 month ago
Linked from 3 awesome lists

cleandatadata-engineeringdata-profilersdata-profilingdata-qualitydata-sciencedata-unit-testsdatacleanerdatacleaningdataqualitydataunittestedaexploratory-analysisexploratory-data-analysisexploratorydataanalysismlopspipelinepipeline-debtpipeline-testingpipeline-tests

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
google-research/t5x A modular framework for training and deploying sequence models at scale 2,706
jaxgaussianprocesses/gpjax Provides a low-level interface to Gaussian process models in JAX for flexible extension and customisation 467
gluon-api/gluon-api A simple and flexible deep learning API for building neural networks 2,300
gradio-app/gradio Enables rapid creation and deployment of web applications for machine learning models and functions using Python 34,557
gee-community/geetools A collection of tools and extensions to the Google Earth Engine Python API for geospatial processing 531
eleutherai/gpt-neox Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. 6,997
greptimeteam/demo-scene Supports demos and talks on time-series databases and data processing pipelines with tools like GreptimeDB, InfluxDB, and Prometheus. 30
stability-ai/stability-sdk An SDK for interacting with Stability AI's APIs to generate images and other artifacts through latent diffusion inference. 2,427
pysimplegui/pysimplegui A Python GUI library that simplifies the development of desktop applications with a simple and intuitive interface. 13,480
ktr0731/evans A gRPC client library with two modes: REPL and CLI, providing automatic service inspection and task automation 4,304
wswup/gridwxcomp Compares weather station data with gridded climate datasets hosted on Google Earth Engine 17
pypi/warehouse The software behind the Python Package Index. 3,617
pygithub/pygithub A Python library to access the GitHub REST API 7,078
amygdala/code-snippets A repository containing small examples and code snippets for Google Cloud Platform services using Python. 156
geus-glaciology-and-climate/pypromice An open-source Python package for processing and handling automated weather station data from Greenland. 14