determined

Experiment manager

A platform for managing and optimizing deep learning experiments with support for distributed training and hyperparameter tuning

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

GitHub

3k stars
85 watching
357 forks
Language: Go
last commit: about 11 hours ago
Linked from 5 awesome lists

data-sciencedeep-learningdistributed-traininghyperparameter-optimizationhyperparameter-searchhyperparameter-tuningkeraskubernetesmachine-learningml-infrastructureml-platformmlopspytorchtensorflow

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
luolc/adabound An optimizer that combines the benefits of Adam and SGD algorithms 2,907
devsisters/dqn-tensorflow An implementation of a deep reinforcement learning algorithm for human-level control in game environments using TensorFlow. 2,480
lightning-ai/pytorch-lightning A framework for building and training AI models on multiple GPUs or TPUs with minimal code changes. 28,402
pytorch/pytorch A Python library providing tensors and dynamic neural networks with strong GPU acceleration 84,371
google-parfait/tensorflow-federated An open-source framework for training machine learning models across decentralized data 2,322
ahkarami/deep-learning-in-production A collection of notes and references on deploying deep learning models in production environments 4,308
pytorch/ignite A high-level library to help with training and evaluating neural networks in PyTorch 4,531
pkmital/tensorflow_tutorials A collection of examples and tutorials illustrating various applications and use cases of TensorFlow in Python 5,639
nvidia/apex Tools for streamlined mixed precision and distributed training in PyTorch 8,417
pyg-team/pytorch_geometric A PyTorch-based library for training and applying Graph Neural Networks to structured data 21,382
catalyst-team/catalyst A PyTorch framework for accelerating deep learning research and development by focusing on reproducibility, rapid experimentation, and codebase reuse. 3,295
facebookresearch/pytorch3d A deep learning library for 3D data processing and computer vision research using PyTorch 8,824
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 135,022
replicate/cog A tool for packaging and deploying machine learning models in a standard, production-ready container environment. 8,081
victoresque/pytorch-template An open-source PyTorch deep learning project template providing a structured environment for building and training various models. 4,754