veScale
LLM trainer
A PyTorch-based framework for training large language models in parallel on multiple devices
A PyTorch Native LLM Training Framework
679 stars
34 watching
34 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list
llm-trainingpytorch
Related projects:
Repository | Description | Stars |
---|---|---|
vpgtrans/vpgtrans | Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs | 270 |
volcengine/verl | A flexible RL training framework designed for large language models | 427 |
open-mmlab/mmengine | Provides a flexible and configurable framework for training deep learning models with PyTorch. | 1,196 |
bobazooba/xllm | A tool for training and fine-tuning large language models using advanced techniques | 387 |
erotemic/netharn | A PyTorch framework for managing and automating deep learning training loops with features like hyperparameter tracking and single-file deployments. | 40 |
luogen1996/lavin | An open-source implementation of a vision-language instructed large language model | 513 |
rdspring1/pytorch_gbw_lm | Trains a large-scale PyTorch language model on the 1-Billion Word dataset | 123 |
lyhue1991/torchkeras | A PyTorch-based model training framework designed to simplify and streamline training workflows by providing a unified interface for various loss functions, optimizers, and validation metrics. | 1,822 |
vhellendoorn/code-lms | A guide to using pre-trained large language models in source code analysis and generation | 1,789 |
tristandeleu/pytorch-maml-rl | Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks | 830 |
xverse-ai/xverse-7b | A multilingual large language model developed by XVERSE Technology Inc. | 50 |
graal-research/poutyne | A PyTorch framework simplifying neural network training with automated boilerplate code and callback utilities | 572 |
csuhan/onellm | A framework for training and fine-tuning multimodal language models on various data types | 601 |
bytedance/lynx-llm | A framework for training GPT4-style language models with multimodal inputs using large datasets and pre-trained models | 231 |
penghao-wu/vstar | PyTorch implementation of guided visual search mechanism for multimodal LLMs | 541 |