veScale

LLM trainer

A PyTorch-based framework for training large language models in parallel on multiple devices

A PyTorch Native LLM Training Framework

GitHub

663 stars
34 watching
34 forks
Language: Python
last commit: 3 months ago
Linked from 1 awesome list

llm-trainingpytorch

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
vpgtrans/vpgtrans Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs 269
volcengine/verl A flexible and efficient reinforcement learning framework designed for large language models. 315
open-mmlab/mmengine Provides a flexible and configurable framework for training deep learning models with PyTorch. 1,179
bobazooba/xllm A tool for training and fine-tuning large language models using advanced techniques 380
erotemic/netharn A PyTorch framework for managing and automating deep learning training loops with features like hyperparameter tracking and single-file deployments. 39
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 508
rdspring1/pytorch_gbw_lm Trains a large-scale PyTorch language model on the 1-Billion Word dataset 123
lyhue1991/torchkeras A PyTorch-based model training framework designed to simplify and streamline training workflows by providing a unified interface for various loss functions, optimizers, and validation metrics. 1,782
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,782
tristandeleu/pytorch-maml-rl Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks 827
xverse-ai/xverse-7b A multilingual large language model developed by XVERSE Technology Inc. 50
graal-research/poutyne A PyTorch framework simplifying neural network training with automated boilerplate code and callback utilities 569
csuhan/onellm A framework for training and fine-tuning multimodal language models on various data types 588
bytedance/lynx-llm A framework for training GPT4-style language models with multimodal inputs using large datasets and pre-trained models 229
penghao-wu/vstar PyTorch implementation of guided visual search mechanism for multimodal LLMs 527