VLFeedback
Vision language model trainer
An annotated preference dataset and training framework for improving large vision language models.
88 stars
2 watching
2 forks
Language: Python
last commit: almost 2 years ago Related projects:
| Repository | Description | Stars |
|---|---|---|
| | This is an open-source project that proposes a novel method to train large-scale vision-language models with minimal resources and no fine-tuning required. | 94 |
| | Develops and trains models for vision-language learning with decoupled language pre-training | 24 |
| | Implementing a unified modal learning framework for generative vision-language models | 43 |
| | A deep learning framework for training multi-modal models with vision and language capabilities. | 1,299 |
| | A comprehensive computer vision library providing efficient algorithms for image analysis and feature extraction | 1,605 |
| | A framework for large-scale cross-modal benchmarks and vision-language tasks in Chinese | 157 |
| | A multimodal AI model that enables real-world vision-language understanding applications | 2,145 |
| | An implementation of a vision language model designed for mobile devices, utilizing a lightweight downsample projector and pre-trained language models. | 1,076 |
| | Pre-trains a multilingual model to bridge vision and language modalities for various downstream applications | 279 |
| | A PyTorch-based framework for training large language models in parallel on multiple devices | 679 |
| | A comprehensive dataset and evaluation framework for Vision-Language Instruction Tuning models | 11 |
| | Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models | 63 |
| | A framework for training and fine-tuning multimodal language models on various data types | 601 |
| | A platform for training and deploying large language and vision models that can use tools to perform tasks | 717 |
| | A PyTorch-based model training framework designed to simplify and streamline training workflows by providing a unified interface for various loss functions, optimizers, and validation metrics. | 1,822 |