VPGTrans

LLM trainer

Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs

Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.

GitHub

270 stars
6 watching
25 forks
Language: Python
last commit: over 2 years ago
large-scale-language-modelingllmvision-language-modelvl-llm

Related projects:

Repository Description Stars
volcengine/vescale A PyTorch-based framework for training large language models in parallel on multiple devices 679
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,789
opengvlab/visionllm A large language model designed to process and generate visual information 956
evolvinglmms-lab/longva An open-source project that enables the transfer of language understanding to vision capabilities through long context processing. 347
llava-vl/llava-plus-codebase A platform for training and deploying large language and vision models that can use tools to perform tasks 717
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 513
shm007g/llama-cult-and-more Provides insights and practical guides for building and using large language models. 427
wisconsinaivision/vip-llava A system designed to enable large multimodal models to understand arbitrary visual prompts 302
volcengine/verl A flexible RL training framework designed for large language models 427
lxtgh/omg-seg Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. 1,336
rdspring1/pytorch_gbw_lm Trains a large-scale PyTorch language model on the 1-Billion Word dataset 123
gmftbygmftby/science-llm A large-scale language model for scientific domain training on redpajama arXiv split 125
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,167
chendelong1999/polite-flamingo Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models 63
xverse-ai/xverse-7b A multilingual large language model developed by XVERSE Technology Inc. 50