veScale

LLM trainer

A PyTorch-based framework for training large language models in parallel on multiple devices

A PyTorch Native LLM Training Framework

679 stars

34 watching

34 forks

Language: Python

last commit: 11 months ago

Linked from 1 awesome list

llm-trainingpytorch

Screenshot of volcengine/veScale website

Backlinks from these awesome lists:

ethicalml/awesome-production-machine-learning

Related projects:

Repository	Description	Stars
vpgtrans/vpgtrans	Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs	270
volcengine/verl	A flexible RL training framework designed for large language models	427
open-mmlab/mmengine	Provides a flexible and configurable framework for training deep learning models with PyTorch.	1,196
bobazooba/xllm	A tool for training and fine-tuning large language models using advanced techniques	387
erotemic/netharn	A PyTorch framework for managing and automating deep learning training loops with features like hyperparameter tracking and single-file deployments.	40
luogen1996/lavin	An open-source implementation of a vision-language instructed large language model	513
rdspring1/pytorch_gbw_lm	Trains a large-scale PyTorch language model on the 1-Billion Word dataset	123
lyhue1991/torchkeras	A PyTorch-based model training framework designed to simplify and streamline training workflows by providing a unified interface for various loss functions, optimizers, and validation metrics.	1,822
vhellendoorn/code-lms	A guide to using pre-trained large language models in source code analysis and generation	1,789
tristandeleu/pytorch-maml-rl	Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks	830
xverse-ai/xverse-7b	A multilingual large language model developed by XVERSE Technology Inc.	50
graal-research/poutyne	A PyTorch framework simplifying neural network training with automated boilerplate code and callback utilities	572
csuhan/onellm	A framework for training and fine-tuning multimodal language models on various data types	601
bytedance/lynx-llm	A framework for training GPT4-style language models with multimodal inputs using large datasets and pre-trained models	231
penghao-wu/vstar	PyTorch implementation of guided visual search mechanism for multimodal LLMs	541