torchtitan
LLM trainer
A native PyTorch library for training large language models using distributed parallelism and optimization techniques.
A native PyTorch Library for large model training
3k stars
44 watching
222 forks
Language: Python
last commit: 1 day ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
pytorch/torchtune | A PyTorch library for easily authoring and experimenting with large language models | 4,479 |
rasbt/llms-from-scratch | Developing and pretraining a GPT-like Large Language Model from scratch | 35,405 |
nvidia/apex | Tools for streamlined mixed precision and distributed training in PyTorch | 8,460 |
ajbrock/biggan-pytorch | An unofficial implementation of PyTorch BigGANs for generating high-fidelity natural images | 2,874 |
huggingface/accelerate | A tool to simplify training and deployment of PyTorch models on various devices and configurations | 8,056 |
nvidia/megatron-lm | A framework for training large language models using scalable and optimized GPU techniques | 10,804 |
turboderp/exllama | A re-implementation of Llama for efficient use with quantized weights on modern GPUs. | 2,783 |
pytorch/ignite | A high-level library to help with training and evaluating neural networks in PyTorch | 4,554 |
pytorch/pytorch | A Python library providing tensors and dynamic neural networks with strong GPU acceleration | 84,978 |
ahmedfgad/torchga | Trains PyTorch models using a genetic algorithm | 96 |
alpha-vllm/llama2-accessory | An open-source toolkit for pretraining and fine-tuning large language models | 2,732 |
pytorchbearer/torchbearer | A PyTorch model fitting library designed to simplify the process of training deep learning models. | 636 |
kaiyangzhou/deep-person-reid | A PyTorch library for training and retraining deep neural networks for person re-identification in images and videos. | 4,353 |
lyhue1991/torchkeras | A PyTorch-based model training framework designed to simplify and streamline training workflows by providing a unified interface for various loss functions, optimizers, and validation metrics. | 1,822 |
jfzhang95/pytorch-deeplab-xception | A PyTorch implementation of the DeepLab-V3-Plus model with support for multiple backbones and datasets | 2,919 |