byteps

Distributed DL framework

A high-performance distributed deep learning framework supporting multiple frameworks and networks

A high performance and generic framework for distributed DNN training

GitHub

4k stars
84 watching
491 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list

deep-learningdistributed-trainingkerasmachine-learningmxnetpytorchtensorflow

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
facebookresearch/fairscale A PyTorch extension library that provides high-performance and large-scale training techniques. 3,210
pytorch/pytorch A Python library providing tensors and dynamic neural networks with strong GPU acceleration 84,978
tensorpack/tensorpack A high-performance neural network training interface for TensorFlow that focuses on speed and flexibility. 6,303
lukemelas/efficientnet-pytorch A PyTorch implementation of EfficientNet convolutional neural networks 7,971
nvidia/apex Tools for streamlined mixed precision and distributed training in PyTorch 8,460
kevinmusgrave/pytorch-metric-learning A PyTorch library for implementing deep metric learning algorithms in computer vision applications. 6,045
kakaobrain/torchgpipe A PyTorch-based library for efficient training of large neural networks using pipeline parallelism and automatic recomputation of gradients. 820
donnemartin/data-science-ipython-notebooks A comprehensive collection of data science and machine learning notebooks using Python and various deep learning frameworks. 27,601
jfzhang95/pytorch-deeplab-xception A PyTorch implementation of the DeepLab-V3-Plus model with support for multiple backbones and datasets 2,919
paddlepaddle/parl A high-performance distributed training framework for Reinforcement Learning 3,296
p-christ/deep-reinforcement-learning-algorithms-with-pytorch PyTorch implementations of popular deep reinforcement learning algorithms and environments. 5,669
dmlc/xgboost An optimized distributed gradient boosting library designed to be highly efficient and flexible 26,396
pytorch/torchtitan A native PyTorch library for training large language models using distributed parallelism and optimization techniques. 2,765
cszn/kair Image restoration toolbox with training and testing codes for various deep learning-based methods 2,994
dmmiller612/sparktorch A PyTorch implementation on Apache Spark for distributed deep learning model training and inference. 339