MultiInstruct

Instruction dataset

A multimodal benchmark dataset designed to evaluate the performance of vision-language foundation models through instruction tuning.

MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning

GitHub

134 stars

7 watching

5 forks

Language: Python

last commit: about 3 years ago

Related projects:

Repository	Description	Stars
flagopen/flaginstruct	A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models	173
x2fd/lvis-instruct4v	A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset	131
pvit-official/pvit	A project that extends large language models by integrating an additional region-level vision encoder to improve visual instruction tuning.	37
salt-nlp/llavar	An open-source project that enhances visual instruction tuning for text-rich image understanding by integrating GPT-4 models with multimodal datasets.	259
mbzuai-nlp/bactrian-x	A collection of multilingual language models trained on a dataset of instructions and responses in various languages.	94
opendatalab/vigc	Autonomously generates high-quality image-text instruction fine-tuning datasets	91
orhonovich/unnatural-instructions	A collection of automatically generated instructions for training language models.	176
xuefuzhao/instructionwild	Creating a large-scale user-based instruction dataset for natural language processing research and development	455
baai-dcai/visual-instruction-tuning	A dataset and model designed to scale visual instruction tuning using language-only GPT-4 models.	164
zjunlp/mol-instructions	A dataset and tools package designed to support the training and evaluation of large language models for molecular biology tasks	255
philipperemy/timit	A collection of acoustic and phonetic speech data designed for training and evaluating automatic speech recognition systems	297
michael-wzhu/promptcblue	A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain	328
dcdmllm/cheetah	A large language model designed to understand and generate instructions with accompanying visual content	360
rucaibox/comvint	Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks	18
aidc-ai/parrot	A method and toolkit for fine-tuning large language models to perform visual instruction tasks in multiple languages.	34