great_expectations

Data quality testing framework

Provides tools and techniques to ensure data quality by defining expected outcomes for data processing pipelines.

Always know what to expect from your data.

GitHub

10k stars
85 watching
2k forks
Language: Python
last commit: about 17 hours ago
Linked from 3 awesome lists

cleandatadata-engineeringdata-profilersdata-profilingdata-qualitydata-sciencedata-unit-testsdatacleanerdatacleaningdataqualitydataunittestedaexploratory-analysisexploratory-data-analysisexploratorydataanalysismlopspipelinepipeline-debtpipeline-testingpipeline-tests

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
google-research/t5x A modular framework for training and deploying sequence models at scale 2,699
jaxgaussianprocesses/gpjax Provides a low-level interface to Gaussian process models in JAX for flexible extension and customisation 462
gluon-api/gluon-api A simple and flexible deep learning API for building neural networks 2,300
gradio-app/gradio Enables rapid creation and deployment of web applications for machine learning models and functions using Python 34,244
gee-community/geetools Tools for processing geospatial data using the Google Earth Engine Python API 529
eleutherai/gpt-neox Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. 6,968
greptimeteam/demo-scene Supports demos and talks on time-series databases and data processing pipelines with tools like GreptimeDB, InfluxDB, and Prometheus. 29
stability-ai/stability-sdk An SDK for interacting with Stability AI's APIs to generate images and other artifacts through latent diffusion inference. 2,427
pysimplegui/pysimplegui A Python GUI library that simplifies the development of desktop applications with a simple and intuitive interface. 13,461
ktr0731/evans A gRPC client library with two modes: REPL and CLI, providing automatic service inspection and task automation 4,297
wswup/gridwxcomp Compares weather station data with gridded climate datasets hosted on Google Earth Engine 17
pypi/warehouse A software system that powers the package registry for Python packages 3,606
pygithub/pygithub A Python library to access the GitHub REST API 7,054
amygdala/code-snippets A repository containing small examples and code snippets for Google Cloud Platform services using Python. 156
geus-glaciology-and-climate/pypromice An open-source Python package for processing and handling automated weather station data from Greenland. 14