LLaVA-Plus-Codebase

Model trainer

A platform for training and deploying large language and vision models that can use tools to perform tasks

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

GitHub

717 stars

12 watching

53 forks

Language: Python

last commit: over 1 year ago

agentlarge-language-modelslarge-multimodal-modelsmultimodal-large-language-modelstool-use

Screenshot of LLaVA-VL/LLaVA-Plus-Codebase website

llava-vl.github.io/llava-plus/

Related projects:

Repository	Description	Stars
wisconsinaivision/vip-llava	A system designed to enable large multimodal models to understand arbitrary visual prompts	302
llava-vl/llava-interactive-demo	An all-in-one demo for interactive image processing and generation	353
alibaba/conv-llava	This project presents an optimization technique for large-scale image models to reduce computational requirements while maintaining performance.	106
vpgtrans/vpgtrans	Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs	270
csuhan/onellm	A framework for training and fine-tuning multimodal language models on various data types	601
bobazooba/xllm	A tool for training and fine-tuning large language models using advanced techniques	387
openbmb/cpm-live	A live training platform for large-scale deep learning models, allowing community participation and collaboration in model development and deployment.	511
flagai-open/aquila2	Provides pre-trained language models and tools for fine-tuning and evaluation	439
chendelong1999/polite-flamingo	Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models	63
yfzhang114/llava-align	Debiasing techniques to minimize hallucinations in large visual language models	75
vhellendoorn/code-lms	A guide to using pre-trained large language models in source code analysis and generation	1,789
mlpc-ucsd/bliva	A multimodal LLM designed to handle text-rich visual questions	270
vishaal27/sus-x	This is an open-source project that proposes a novel method to train large-scale vision-language models with minimal resources and no fine-tuning required.	94
volcengine/vescale	A PyTorch-based framework for training large language models in parallel on multiple devices	679
microsoft/llava-med	A research project aimed at building large language and vision models for biomedical applications with capabilities comparable to GPT-4.	1,622