dirty_cat
Categorical cleaner
A Python library that helps machine learning on imperfect categorical data
Machine learning on dirty tabular data (legacy clone of skrub)
16 stars
0 watching
4 forks
Language: Python
last commit: over 1 year ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
alfred82santa/dirty-models | A Python library that provides a way to easily create and manage data models without modifying the original data. | 10 |
davified/clean-code-ml | Adapting clean code principles to machine learning and data science in Python | 713 |
neuraxio/kata-clean-machine-learning-from-dirty-code | Converting dirty machine learning code into clean, modular, and reusable components using the Pipe and Filter Design Pattern for Machine Learning. | 18 |
msamogh/nonechucks | Library that provides dynamic data cleaning and filtering capabilities for PyTorch datasets and samplers | 377 |
cgnorthcutt/rankpruning | An algorithm and package for handling noisy labels in binary classification problems | 82 |
tidalcycles/clean-samples | Provides pre-cleaned and documented audio samples for musical experimentation | 44 |
kastnerkyle/kaggle-dogs-vs-cats | A Python implementation of a machine learning solution for classifying images as dogs or cats from the Kaggle competition. | 66 |
dizballanze/django-eraserhead | Tool to optimize database usage in Django by identifying and suggesting the removal of unused fields. | 196 |
kthyeon/fine_official | An implementation of a method for training machine learning models using noisy labels | 38 |
sergioburdisso/pyss3 | A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
pytorch/data | A PyTorch project providing data loading utilities and scalable dataloading solutions | 1,133 |
databasecleaner/database_cleaner-mongoid | A tool for cleaning up data in MongoDB databases. | 9 |
ayush1997/visualize_ml | A Python package for data analysis and visualization in machine learning | 200 |
hcguersoy/cleanreg | Removes unnecessary image manifests from a Docker Registry | 56 |
scour-project/scour | An SVG optimizer/cleaner tool that reduces the size of vector graphics by removing unnecessary data. | 780 |