VLFeedback
Vision language model trainer
An annotated preference dataset and training framework for improving large vision language models.
88 stars
2 watching
2 forks
Language: Python
last commit: about 1 year ago Related projects:
Repository | Description | Stars |
---|---|---|
| This is an open-source project that proposes a novel method to train large-scale vision-language models with minimal resources and no fine-tuning required. | 94 |
| Develops and trains models for vision-language learning with decoupled language pre-training | 24 |
| Implementing a unified modal learning framework for generative vision-language models | 43 |
| A deep learning framework for training multi-modal models with vision and language capabilities. | 1,299 |
| A comprehensive computer vision library providing efficient algorithms for image analysis and feature extraction | 1,605 |
| A framework for large-scale cross-modal benchmarks and vision-language tasks in Chinese | 157 |
| A multimodal AI model that enables real-world vision-language understanding applications | 2,145 |
| An implementation of a vision language model designed for mobile devices, utilizing a lightweight downsample projector and pre-trained language models. | 1,076 |
| Pre-trains a multilingual model to bridge vision and language modalities for various downstream applications | 279 |
| A PyTorch-based framework for training large language models in parallel on multiple devices | 679 |
| A comprehensive dataset and evaluation framework for Vision-Language Instruction Tuning models | 11 |
| Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models | 63 |
| A framework for training and fine-tuning multimodal language models on various data types | 601 |
| A platform for training and deploying large language and vision models that can use tools to perform tasks | 717 |
| A PyTorch-based model training framework designed to simplify and streamline training workflows by providing a unified interface for various loss functions, optimizers, and validation metrics. | 1,822 |