MultiInstruct
Instruction dataset
A multimodal benchmark dataset designed to evaluate the performance of vision-language foundation models through instruction tuning.
MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning
133 stars
7 watching
5 forks
Language: Python
last commit: over 1 year ago Related projects:
Repository | Description | Stars |
---|---|---|
flagopen/flaginstruct | A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models | 173 |
x2fd/lvis-instruct4v | A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset | 131 |
pvit-official/pvit | A project that extends large language models by integrating an additional region-level vision encoder to improve visual instruction tuning. | 36 |
salt-nlp/llavar | An open-source project that enhances visual instruction tuning for text-rich image understanding by integrating GPT-4 models with multimodal datasets. | 258 |
mbzuai-nlp/bactrian-x | A collection of multilingual language models trained on a dataset of instructions and responses in various languages. | 94 |
opendatalab/vigc | Autonomously generates high-quality image-text instruction fine-tuning datasets | 90 |
orhonovich/unnatural-instructions | A collection of automatically generated instructions for training language models. | 175 |
xuefuzhao/instructionwild | Creating a large-scale user-based instruction dataset for natural language processing research and development | 453 |
baai-dcai/visual-instruction-tuning | A dataset and model designed to scale visual instruction tuning using language-only GPT-4 models. | 163 |
zjunlp/mol-instructions | A dataset and tools package designed to support the training and evaluation of large language models for molecular biology tasks | 252 |
philipperemy/timit | A collection of acoustic and phonetic speech data designed for training and evaluating automatic speech recognition systems | 294 |
michael-wzhu/promptcblue | A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain | 323 |
dcdmllm/cheetah | A large language model designed to understand and generate instructions with accompanying visual content | 356 |
rucaibox/comvint | Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks | 18 |
aidc-ai/parrot | A method and toolkit for fine-tuning large language models to perform visual instruction tasks in multiple languages. | 32 |