byteps

Distributed DL framework

A high-performance distributed deep learning framework supporting multiple frameworks and networks

A high performance and generic framework for distributed DNN training

GitHub

4k stars
84 watching
488 forks
Language: Python
last commit: about 1 year ago
Linked from 1 awesome list

deep-learningdistributed-trainingkerasmachine-learningmxnetpytorchtensorflow

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
facebookresearch/fairscale A PyTorch extension library that provides high-performance and large-scale training techniques. 3,193
pytorch/pytorch A Python library providing tensors and dynamic neural networks with strong GPU acceleration 83,959
tensorpack/tensorpack A high-performance neural network training interface for TensorFlow that focuses on speed and flexibility. 6,303
lukemelas/efficientnet-pytorch A PyTorch implementation of EfficientNet convolutional neural networks 7,921
nvidia/apex Tools for streamlined mixed precision and distributed training in PyTorch 8,407
kevinmusgrave/pytorch-metric-learning A PyTorch-based library for easy implementation of deep metric learning in applications. 6,012
kakaobrain/torchgpipe A PyTorch-based library for efficient training of large neural networks using pipeline parallelism and automatic recomputation of gradients. 817
donnemartin/data-science-ipython-notebooks A comprehensive collection of data science and machine learning notebooks using Python and various deep learning frameworks. 27,470
jfzhang95/pytorch-deeplab-xception A PyTorch implementation of the DeepLab-V3-Plus model with support for multiple backbones and datasets 2,910
paddlepaddle/parl A high-performance distributed training framework for Reinforcement Learning 3,273
p-christ/deep-reinforcement-learning-algorithms-with-pytorch PyTorch implementations of popular deep reinforcement learning algorithms and environments. 5,640
dmlc/xgboost An optimized distributed gradient boosting library designed to be highly efficient and flexible 26,299
pytorch/torchtitan A native PyTorch library for large-scale language model training with distributed training capabilities 2,615
cszn/kair Image restoration toolbox with training and testing codes for various deep learning-based methods 2,957
dmmiller612/sparktorch A PyTorch implementation on Apache Spark for distributed deep learning model training and inference. 339