determined

Experiment manager

A platform for managing and optimizing deep learning experiments with support for distributed training and hyperparameter tuning

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

GitHub

3k stars
85 watching
359 forks
Language: Go
last commit: about 1 month ago
Linked from 5 awesome lists

data-sciencedeep-learningdistributed-traininghyperparameter-optimizationhyperparameter-searchhyperparameter-tuningkeraskubernetesmachine-learningml-infrastructureml-platformmlopspytorchtensorflow

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
luolc/adabound An optimizer that combines the benefits of Adam and SGD algorithms 2,908
devsisters/dqn-tensorflow An implementation of a deep reinforcement learning algorithm for human-level control in game environments using TensorFlow. 2,491
lightning-ai/pytorch-lightning A framework for building and training AI models on multiple GPUs or TPUs with minimal code changes. 28,636
pytorch/pytorch A Python library providing tensors and dynamic neural networks with strong GPU acceleration 84,978
google-parfait/tensorflow-federated An open-source framework for training machine learning models across decentralized data 2,323
ahkarami/deep-learning-in-production A collection of notes and references on deploying deep learning models in production environments 4,313
pytorch/ignite A high-level library to help with training and evaluating neural networks in PyTorch 4,554
pkmital/tensorflow_tutorials A collection of examples and tutorials illustrating various applications and use cases of TensorFlow in Python 5,636
nvidia/apex Tools for streamlined mixed precision and distributed training in PyTorch 8,460
pyg-team/pytorch_geometric A PyTorch-based library for training and applying Graph Neural Networks to structured data 21,597
catalyst-team/catalyst A PyTorch framework for accelerating deep learning research and development by focusing on reproducibility, rapid experimentation, and codebase reuse. 3,300
facebookresearch/pytorch3d A deep learning library for 3D data processing and computer vision research using PyTorch 8,889
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 136,357
replicate/cog A tool for packaging and deploying machine learning models in a standard, production-ready container environment. 8,169
victoresque/pytorch-template An open-source PyTorch deep learning project template providing a structured environment for building and training various models. 4,784