VPGTrans

LLM trainer

Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs

Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.

GitHub

270 stars

6 watching

25 forks

Language: Python

last commit: over 2 years ago

large-scale-language-modelingllmvision-language-modelvl-llm

vpgtrans.github.io/

Related projects:

Repository	Description	Stars
volcengine/vescale	A PyTorch-based framework for training large language models in parallel on multiple devices	679
vhellendoorn/code-lms	A guide to using pre-trained large language models in source code analysis and generation	1,789
opengvlab/visionllm	A large language model designed to process and generate visual information	956
evolvinglmms-lab/longva	An open-source project that enables the transfer of language understanding to vision capabilities through long context processing.	347
llava-vl/llava-plus-codebase	A platform for training and deploying large language and vision models that can use tools to perform tasks	717
luogen1996/lavin	An open-source implementation of a vision-language instructed large language model	513
shm007g/llama-cult-and-more	Provides insights and practical guides for building and using large language models.	427
wisconsinaivision/vip-llava	A system designed to enable large multimodal models to understand arbitrary visual prompts	302
volcengine/verl	A flexible RL training framework designed for large language models	427
lxtgh/omg-seg	Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model.	1,336
rdspring1/pytorch_gbw_lm	Trains a large-scale PyTorch language model on the 1-Billion Word dataset	123
gmftbygmftby/science-llm	A large-scale language model for scientific domain training on redpajama arXiv split	125
openai/finetune-transformer-lm	This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture.	2,167
chendelong1999/polite-flamingo	Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models	63
xverse-ai/xverse-7b	A multilingual large language model developed by XVERSE Technology Inc.	50