ydata-synthetic

Synthetic data generator

An educational package providing generative models for synthetic data generation.

Synthetic data generators for tabular and time-series data

GitHub

1k stars
32 watching
235 forks
Language: Jupyter Notebook
last commit: 15 days ago
Linked from 2 awesome lists

datagenerationdatageneratordeep-learninggangan-architecturesgansgenerative-adversarial-networkmachine-learningpython3pytorchsynthetic-datatensorflow2time-seriestimeseriestraining-data

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
gretelai/gretel-synthetics A toolkit for generating synthetic data while preserving differential privacy 597
dmey/synthia Software for generating synthetic multivariate data with statistical properties preserved 57
shuttle-hq/synth A tool for generating realistic data from a declarative configuration language 1,387
synthetichealth/synthea Generates synthetic patient data and associated health records in various formats for testing and simulation purposes. 2,183
kaeluka/mock-data-gen A library to generate random data from IO-TS types for testing purposes. 7
fnozarian/carla-kitti Generates synthetic data from the CARLA simulator for KITTI 2D/3D Object Detection tasks. 39
nomemory/mockneat A powerful data-generation and mocking library for creating realistic data in various formats 529
hyuunnn/hyara A plugin for multiple reverse engineering tools to generate YARA rules 223
nvidia/dataset_synthesizer Generates synthetic images and associated data for training deep learning models 573
fakedata-haskell/fakedata Generates realistic fake data for various purposes such as testing and simulation 149
pyg-team/pytorch-frame A deep learning framework for handling heterogeneous tabular data with diverse column types 543
ollieboyne/blendersynth A Python library for generating synthetic 3D datasets using Blender with custom features for rich per-pixel information and multiview rendering. 64
pioz/faker A tool for generating realistic, customizable fake data and structs for software development. 92
argilla-io/distilabel A framework for generating synthetic data and AI feedback to accelerate AI development 1,650
sdv-dev/sdv A library for generating synthetic tabular data based on real-world patterns 2,380