MultiInstruct
Instruction dataset
A multimodal benchmark dataset designed to evaluate the performance of vision-language foundation models through instruction tuning.
MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning
134 stars
7 watching
5 forks
Language: Python
last commit: over 1 year ago Related projects:
Repository | Description | Stars |
---|---|---|
| A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models | 173 |
| A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset | 131 |
| A project that extends large language models by integrating an additional region-level vision encoder to improve visual instruction tuning. | 37 |
| An open-source project that enhances visual instruction tuning for text-rich image understanding by integrating GPT-4 models with multimodal datasets. | 259 |
| A collection of multilingual language models trained on a dataset of instructions and responses in various languages. | 94 |
| Autonomously generates high-quality image-text instruction fine-tuning datasets | 91 |
| A collection of automatically generated instructions for training language models. | 176 |
| Creating a large-scale user-based instruction dataset for natural language processing research and development | 455 |
| A dataset and model designed to scale visual instruction tuning using language-only GPT-4 models. | 164 |
| A dataset and tools package designed to support the training and evaluation of large language models for molecular biology tasks | 255 |
| A collection of acoustic and phonetic speech data designed for training and evaluating automatic speech recognition systems | 297 |
| A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain | 328 |
| A large language model designed to understand and generate instructions with accompanying visual content | 360 |
| Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks | 18 |
| A method and toolkit for fine-tuning large language models to perform visual instruction tasks in multiple languages. | 34 |