LLaVA-Plus-Codebase

Model trainer

A platform for training and deploying large language and vision models that can use tools to perform tasks

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

GitHub

704 stars
11 watching
53 forks
Language: Python
last commit: 10 months ago
agentlarge-language-modelslarge-multimodal-modelsmultimodal-large-language-modelstool-use

Related projects:

Repository Description Stars
wisconsinaivision/vip-llava A system designed to enable large multimodal models to understand arbitrary visual prompts 294
llava-vl/llava-interactive-demo An all-in-one demo for interactive image processing and generation 351
alibaba/conv-llava This project presents an optimization technique for large-scale image models to reduce computational requirements while maintaining performance. 104
vpgtrans/vpgtrans Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs 269
csuhan/onellm A framework for training and fine-tuning multimodal language models on various data types 588
bobazooba/xllm A tool for training and fine-tuning large language models using advanced techniques 380
openbmb/cpm-live A live training platform for large-scale deep learning models, allowing community participation and collaboration in model development and deployment. 511
flagai-open/aquila2 Provides pre-trained language models and tools for fine-tuning and evaluation 437
chendelong1999/polite-flamingo Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models 63
yfzhang114/llava-align Debiasing techniques to minimize hallucinations in large visual language models 71
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,782
mlpc-ucsd/bliva A multimodal LLM designed to handle text-rich visual questions 269
vishaal27/sus-x This is an open-source project that proposes a novel method to train large-scale vision-language models with minimal resources and no fine-tuning required. 94
volcengine/vescale A PyTorch-based framework for training large language models in parallel on multiple devices 663
microsoft/llava-med A research project aimed at building large language and vision models for biomedical applications with capabilities comparable to GPT-4. 1,556