SDV

Data generator

A library for generating synthetic tabular data based on real-world patterns

Synthetic data generation for tabular data

GitHub

2k stars
45 watching
317 forks
Language: Python
last commit: 8 days ago
Linked from 1 awesome list

data-generationdeep-learninggangansgenerative-adversarial-networkgenerative-aigenerative-modelgenerativeaimachine-learningmulti-tablerelational-datasetssdvsynthetic-datasynthetic-data-generationtime-series

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dmey/synthia Software for generating synthetic multivariate data with statistical properties preserved 57
ydataai/ydata-synthetic An educational package providing generative models for synthetic data generation. 1,441
trekhleb/homemade-machine-learning Practices implementing popular machine learning algorithms from scratch to gain a deeper understanding of their mathematics 23,121
iterative/dvc Helps develop reproducible machine learning projects by versioning data and models, tracking experiments, and comparing results. 13,943
ollieboyne/blendersynth A Python library for generating synthetic 3D datasets using Blender with custom features for rich per-pixel information and multiview rendering. 64
mwaskom/seaborn A high-level interface for statistical data visualization 12,575
shap/shap Provides an algorithm to explain the output of machine learning models using game theory and Shapley values. 22,917
gretelai/gretel-synthetics A toolkit for generating synthetic data while preserving differential privacy 597
manujosephv/pytorch_tabular A deep learning framework specifically designed for tabular data, providing a standardized approach to modeling and deploying complex machine learning models. 1,391
stability-ai/stability-sdk An SDK for interacting with Stability AI's APIs to generate images and other artifacts through latent diffusion inference. 2,425
sarababakn/mfcl-neurips23 A framework for mitigating catastrophic forgetting in federated learning for vision tasks using data synthesis from past distributions. 15
nrel/sup3r Creates synthetic high-resolution spatiotemporal data for renewable energy resources using generative adversarial networks. 87
jvalegre/robert Automated machine learning protocols for cheminformatics using Python 38
iceye-ltd/icecube A Python library designed to organize SAR images and annotations for supervised machine learning applications. 82
holoviz/panel A powerful data exploration and web app framework that lets you build complex applications entirely in Python using popular visualization tools 4,802