nonechucks

Data cleaner

Library that provides dynamic data cleaning and filtering capabilities for PyTorch datasets and samplers

Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!

GitHub

378 stars
3 watching
27 forks
Language: Python
last commit: about 2 years ago
data-cleaningdata-pipelinedata-preprocessingdata-processingmachine-learningpreprocessingpytorchtorch

Related projects:

Repository Description Stars
pytorch/data A PyTorch project providing data loading utilities and scalable dataloading solutions 1,132
nearai/torchfold A PyTorch module for dynamic batching and optimized computation on deep neural networks 221
zhanghang1989/pytorch-encoding A Python framework for building deep learning models with optimized encoding layers and batch normalization. 2,043
leehomyc/mixup_pytorch An implementation of a novel data augmentation technique to improve deep learning model performance on image classification tasks. 123
bloodaxe/pytorch-toolbelt A comprehensive Python library with PyTorch extensions for rapid prototyping and machine learning model development. 1,522
pyg-team/pytorch-frame A deep learning framework for handling heterogeneous tabular data with diverse column types 552
pistony/torch-toolbox A collection of reusable utility functions and classes to simplify PyTorch development 417
ptrblck/pytorch_misc A collection of code snippets and examples exploring various PyTorch concepts and techniques 547
mrdrozdov-github/pytorch-extras A collection of utility functions to extend PyTorch functionality 50
tristandeleu/pytorch-meta Provides tools and datasets for meta-learning and few-shot learning in deep learning 1,993
jacobgil/pytorch-pruning This project provides a PyTorch implementation of pruning techniques to reduce the computational resources required for neural network inference. 875
viking-sudo-rm/stacknn This project provides differentiable stacks and queues in PyTorch to facilitate the integration of data structures into models. 142
kaiyangzhou/dassl.pytorch A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. 1,227
nhagar/substack_api A lightweight Python wrapper around Substack's API to facilitate data collection and access. 21
cemoody/topicsne An implementation of t-SNE in PyTorch for MNIST dataset analysis 475