accelerate

Model trainer

A tool to simplify training and deployment of PyTorch models on various devices and configurations

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

GitHub

8k stars
97 watching
968 forks
Language: Python
last commit: 6 days ago
Linked from 3 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
nvidia/apex Tools for streamlined mixed precision and distributed training in PyTorch 8,407
pytorch/pytorch A Python library providing tensors and dynamic neural networks with strong GPU acceleration 83,959
pytorch/torchtitan A native PyTorch library for large-scale language model training with distributed training capabilities 2,615
huggingface/optimum A toolkit providing optimization tools and hardware acceleration for training and inference of machine learning models 2,572
pytorch/serve A tool for serving and scaling PyTorch models in production environments 4,217
facebookresearch/fairscale A PyTorch extension library that provides high-performance and large-scale training techniques. 3,193
rtqichen/torchdiffeq A PyTorch implementation of differentiable ODE solvers with GPU support and efficient backpropagation 5,588
pytorch/ignite A high-level library to help with training and evaluating neural networks in PyTorch 4,526
zllrunning/face-parsing.pytorch Implementation of a face parsing model using PyTorch and a modified BiSeNet architecture. 2,317
amdegroot/ssd.pytorch An implementation of a deep learning-based object detection system in PyTorch. 5,146
catalyst-team/catalyst A PyTorch framework for accelerating deep learning research and development by focusing on reproducibility, rapid experimentation, and codebase reuse. 3,295
moskomule/senet.pytorch An implementation of Squeeze-and-Excitation Networks (SE-Nets) for deep learning image classification tasks 2,285
huggingface/peft An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters 16,437
jwyang/faster-rcnn.pytorch An object detection implementation built on top of PyTorch, supporting multi-image batch training and multiple GPUs. 7,693
pytorch/glow A compiler and execution engine for neural networks that generates optimized code for hardware accelerators 3,235