dvc

Data Management Tool

A tool for managing data and models in machine learning projects to ensure reproducibility and collaboration.

🦉 Data Versioning and ML Experiments

GitHub

14k stars
135 watching
1k forks
Language: Python
last commit: 6 days ago
Linked from 6 awesome lists

aidata-sciencedata-version-controldeveloper-toolsmachine-learningreproducibilityunstructured-data

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ddbourgin/numpy-ml A collection of machine learning algorithms implemented in NumPy for rapid experimentation and prototyping. 15,466
pachyderm/pachyderm Automates data transformations with versioning and lineage tracking for scalable data pipelines 6,179
iterative/dvclive A tool for tracking machine learning metrics and parameters in an integrated workflow with Git and DVC. 167
sdv-dev/sdv A library for generating synthetic tabular data based on real-world patterns 2,380
iterative/cml Automates machine learning workflows and generates reports on every pull request. 4,038
thtrieu/darkflow Tools and scripts for training and deploying real-time object detection models using TensorFlow 6,132
voxel51/fiftyone Improves machine learning workflows by enabling faster and more effective data visualization and model interpretation 8,875
netflix/metaflow A platform that enables scientists and engineers to build, deploy, and manage complex data science projects efficiently 8,246
dmlc/gluon-cv A toolkit for building and deploying deep learning models in computer vision 5,833
rhiever/data-analysis-and-machine-learning-projects A repository of teaching materials, code, and data for various data analysis and machine learning projects. 6,128
dotnet/machinelearning-samples A collection of samples and examples demonstrating the usage of ML.NET for machine learning tasks in .NET applications. 4,490
mlflow/mlflow A platform to manage the entire machine learning lifecycle, from experiment tracking to model deployment. 18,781
dvlab-research/mgm An open-source framework for training large language models with vision capabilities. 3,211
openvinotoolkit/open_model_zoo A collection of pre-trained deep learning models and demo applications for accelerating inference tasks 4,098
eriklindernoren/ml-from-scratch Provides implementations of fundamental machine learning models and algorithms from scratch in Python 24,003