ydata-synthetic

Synthetic data generator

An educational package providing generative models for synthetic data generation.

Synthetic data generators for tabular and time-series data

GitHub

1k stars
32 watching
238 forks
Language: Jupyter Notebook
last commit: about 2 months ago
Linked from 2 awesome lists

datagenerationdatageneratordeep-learninggangan-architecturesgansgenerative-adversarial-networkmachine-learningpython3pytorchsynthetic-datatensorflow2time-seriestimeseriestraining-data

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
gretelai/gretel-synthetics A toolkit for generating synthetic data while preserving differential privacy 602
dmey/synthia Software for generating synthetic multivariate data with statistical properties preserved 57
shuttle-hq/synth A tool for generating realistic data from a declarative configuration language 1,392
synthetichealth/synthea Generates synthetic patient data and associated health records in various formats for testing and simulation purposes. 2,213
kaeluka/mock-data-gen A library to generate random data from IO-TS types for testing purposes. 7
fnozarian/carla-kitti Generates synthetic data from the CARLA simulator for KITTI 2D/3D Object Detection tasks. 40
nomemory/mockneat A powerful data-generation and mocking library for creating realistic data in various formats 531
hyuunnn/hyara A plugin for multiple reverse engineering tools to generate YARA rules 224
nvidia/dataset_synthesizer Generates synthetic images and associated data for training deep learning models 574
fakedata-haskell/fakedata Generates realistic fake data for various purposes such as testing and simulation 149
pyg-team/pytorch-frame A deep learning framework for handling heterogeneous tabular data with diverse column types 582
ollieboyne/blendersynth A Python library for generating synthetic 3D datasets using Blender with custom features for rich per-pixel information and multiview rendering. 64
pioz/faker A tool for generating realistic, customizable fake data and structs for software development. 95
argilla-io/distilabel A framework for generating synthetic data and AI feedback to accelerate AI development 1,750
sdv-dev/sdv A library for generating synthetic tabular data based on real-world patterns 2,416