veScale
LLM trainer
A PyTorch-based framework for training large language models in parallel on multiple devices
A PyTorch Native LLM Training Framework
663 stars
34 watching
34 forks
Language: Python
last commit: 3 months ago
Linked from 1 awesome list
llm-trainingpytorch
Related projects:
Repository | Description | Stars |
---|---|---|
vpgtrans/vpgtrans | Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs | 269 |
volcengine/verl | A flexible and efficient reinforcement learning framework designed for large language models. | 315 |
open-mmlab/mmengine | Provides a flexible and configurable framework for training deep learning models with PyTorch. | 1,179 |
bobazooba/xllm | A tool for training and fine-tuning large language models using advanced techniques | 380 |
erotemic/netharn | A PyTorch framework for managing and automating deep learning training loops with features like hyperparameter tracking and single-file deployments. | 39 |
luogen1996/lavin | An open-source implementation of a vision-language instructed large language model | 508 |
rdspring1/pytorch_gbw_lm | Trains a large-scale PyTorch language model on the 1-Billion Word dataset | 123 |
lyhue1991/torchkeras | A PyTorch-based model training framework designed to simplify and streamline training workflows by providing a unified interface for various loss functions, optimizers, and validation metrics. | 1,782 |
vhellendoorn/code-lms | A guide to using pre-trained large language models in source code analysis and generation | 1,782 |
tristandeleu/pytorch-maml-rl | Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks | 827 |
xverse-ai/xverse-7b | A multilingual large language model developed by XVERSE Technology Inc. | 50 |
graal-research/poutyne | A PyTorch framework simplifying neural network training with automated boilerplate code and callback utilities | 569 |
csuhan/onellm | A framework for training and fine-tuning multimodal language models on various data types | 588 |
bytedance/lynx-llm | A framework for training GPT4-style language models with multimodal inputs using large datasets and pre-trained models | 229 |
penghao-wu/vstar | PyTorch implementation of guided visual search mechanism for multimodal LLMs | 527 |