MultiInstruct

Instruction dataset

A multimodal benchmark dataset designed to evaluate the performance of vision-language foundation models through instruction tuning.

MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning

GitHub

133 stars
7 watching
5 forks
Language: Python
last commit: over 1 year ago

Related projects:

Repository Description Stars
flagopen/flaginstruct A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models 173
x2fd/lvis-instruct4v A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset 131
pvit-official/pvit A project that extends large language models by integrating an additional region-level vision encoder to improve visual instruction tuning. 36
salt-nlp/llavar An open-source project that enhances visual instruction tuning for text-rich image understanding by integrating GPT-4 models with multimodal datasets. 258
mbzuai-nlp/bactrian-x A collection of multilingual language models trained on a dataset of instructions and responses in various languages. 94
opendatalab/vigc Autonomously generates high-quality image-text instruction fine-tuning datasets 90
orhonovich/unnatural-instructions A collection of automatically generated instructions for training language models. 175
xuefuzhao/instructionwild Creating a large-scale user-based instruction dataset for natural language processing research and development 453
baai-dcai/visual-instruction-tuning A dataset and model designed to scale visual instruction tuning using language-only GPT-4 models. 163
zjunlp/mol-instructions A dataset and tools package designed to support the training and evaluation of large language models for molecular biology tasks 252
philipperemy/timit A collection of acoustic and phonetic speech data designed for training and evaluating automatic speech recognition systems 294
michael-wzhu/promptcblue A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain 323
dcdmllm/cheetah A large language model designed to understand and generate instructions with accompanying visual content 356
rucaibox/comvint Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks 18
aidc-ai/parrot A method and toolkit for fine-tuning large language models to perform visual instruction tasks in multiple languages. 30