torchtitan

LLM trainer

A native PyTorch library for training large language models using distributed parallelism and optimization techniques.

A native PyTorch Library for large model training

GitHub

3k stars
44 watching
222 forks
Language: Python
last commit: 1 day ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
pytorch/torchtune A PyTorch library for easily authoring and experimenting with large language models 4,479
rasbt/llms-from-scratch Developing and pretraining a GPT-like Large Language Model from scratch 35,405
nvidia/apex Tools for streamlined mixed precision and distributed training in PyTorch 8,460
ajbrock/biggan-pytorch An unofficial implementation of PyTorch BigGANs for generating high-fidelity natural images 2,874
huggingface/accelerate A tool to simplify training and deployment of PyTorch models on various devices and configurations 8,056
nvidia/megatron-lm A framework for training large language models using scalable and optimized GPU techniques 10,804
turboderp/exllama A re-implementation of Llama for efficient use with quantized weights on modern GPUs. 2,783
pytorch/ignite A high-level library to help with training and evaluating neural networks in PyTorch 4,554
pytorch/pytorch A Python library providing tensors and dynamic neural networks with strong GPU acceleration 84,978
ahmedfgad/torchga Trains PyTorch models using a genetic algorithm 96
alpha-vllm/llama2-accessory An open-source toolkit for pretraining and fine-tuning large language models 2,732
pytorchbearer/torchbearer A PyTorch model fitting library designed to simplify the process of training deep learning models. 636
kaiyangzhou/deep-person-reid A PyTorch library for training and retraining deep neural networks for person re-identification in images and videos. 4,353
lyhue1991/torchkeras A PyTorch-based model training framework designed to simplify and streamline training workflows by providing a unified interface for various loss functions, optimizers, and validation metrics. 1,822
jfzhang95/pytorch-deeplab-xception A PyTorch implementation of the DeepLab-V3-Plus model with support for multiple backbones and datasets 2,919