veScale

LLM trainer

A PyTorch-based framework for training large language models in parallel on multiple devices

A PyTorch Native LLM Training Framework

GitHub

679 stars
34 watching
34 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list

llm-trainingpytorch

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
vpgtrans/vpgtrans Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs 270
volcengine/verl A flexible RL training framework designed for large language models 427
open-mmlab/mmengine Provides a flexible and configurable framework for training deep learning models with PyTorch. 1,196
bobazooba/xllm A tool for training and fine-tuning large language models using advanced techniques 387
erotemic/netharn A PyTorch framework for managing and automating deep learning training loops with features like hyperparameter tracking and single-file deployments. 40
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 513
rdspring1/pytorch_gbw_lm Trains a large-scale PyTorch language model on the 1-Billion Word dataset 123
lyhue1991/torchkeras A PyTorch-based model training framework designed to simplify and streamline training workflows by providing a unified interface for various loss functions, optimizers, and validation metrics. 1,822
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,789
tristandeleu/pytorch-maml-rl Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks 830
xverse-ai/xverse-7b A multilingual large language model developed by XVERSE Technology Inc. 50
graal-research/poutyne A PyTorch framework simplifying neural network training with automated boilerplate code and callback utilities 572
csuhan/onellm A framework for training and fine-tuning multimodal language models on various data types 601
bytedance/lynx-llm A framework for training GPT4-style language models with multimodal inputs using large datasets and pre-trained models 231
penghao-wu/vstar PyTorch implementation of guided visual search mechanism for multimodal LLMs 541