LLaVA-Plus-Codebase
Model trainer
A platform for training and deploying large language and vision models that can use tools to perform tasks
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
704 stars
11 watching
53 forks
Language: Python
last commit: 10 months ago agentlarge-language-modelslarge-multimodal-modelsmultimodal-large-language-modelstool-use
Related projects:
Repository | Description | Stars |
---|---|---|
wisconsinaivision/vip-llava | A system designed to enable large multimodal models to understand arbitrary visual prompts | 294 |
llava-vl/llava-interactive-demo | An all-in-one demo for interactive image processing and generation | 351 |
alibaba/conv-llava | This project presents an optimization technique for large-scale image models to reduce computational requirements while maintaining performance. | 104 |
vpgtrans/vpgtrans | Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs | 269 |
csuhan/onellm | A framework for training and fine-tuning multimodal language models on various data types | 588 |
bobazooba/xllm | A tool for training and fine-tuning large language models using advanced techniques | 380 |
openbmb/cpm-live | A live training platform for large-scale deep learning models, allowing community participation and collaboration in model development and deployment. | 511 |
flagai-open/aquila2 | Provides pre-trained language models and tools for fine-tuning and evaluation | 437 |
chendelong1999/polite-flamingo | Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models | 63 |
yfzhang114/llava-align | Debiasing techniques to minimize hallucinations in large visual language models | 71 |
vhellendoorn/code-lms | A guide to using pre-trained large language models in source code analysis and generation | 1,782 |
mlpc-ucsd/bliva | A multimodal LLM designed to handle text-rich visual questions | 269 |
vishaal27/sus-x | This is an open-source project that proposes a novel method to train large-scale vision-language models with minimal resources and no fine-tuning required. | 94 |
volcengine/vescale | A PyTorch-based framework for training large language models in parallel on multiple devices | 663 |
microsoft/llava-med | A research project aimed at building large language and vision models for biomedical applications with capabilities comparable to GPT-4. | 1,556 |