LVIS-INSTRUCT4V

Visual Instructions

A dataset of fine-grained visual instructions generated by prompting a large language model with images from another dataset

131 stars

3 watching

0 forks

last commit: over 2 years ago

Related projects:

Repository	Description	Stars
xuefuzhao/instructionwild	Creating a large-scale user-based instruction dataset for natural language processing research and development	455
flagopen/flaginstruct	A collection of diverse instruction corpora for improving the development and tuning of Chinese Language Models	173
vt-nlp/multiinstruct	A multimodal benchmark dataset designed to evaluate the performance of vision-language foundation models through instruction tuning.	134
orhonovich/unnatural-instructions	A collection of automatically generated instructions for training language models.	176
jy0205/lavit	A unified framework for training large language models to understand and generate visual content	544
rucaibox/comvint	Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks	18
salt-nlp/llavar	An open-source project that enhances visual instruction tuning for text-rich image understanding by integrating GPT-4 models with multimodal datasets.	259
ffxsam/vue-typescript-cookbook	A cookbook and resource guide for developers learning Vue.js with TypeScript	273
pvit-official/pvit	A project that extends large language models by integrating an additional region-level vision encoder to improve visual instruction tuning.	37
ncsoft/cap2qa	A dataset and implementation of a method to generate instructions based on visual data	5
aidc-ai/parrot	A method and toolkit for fine-tuning large language models to perform visual instruction tasks in multiple languages.	34
vefstathiou/so_word2vec	This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks.	40
alexcode/vue2vis	Provides Vue.js 2 adapters for Vis.js libraries	217
freedomintelligence/allava	A collection of datasets and models designed to support the training of lite vision-language models.	249
vlf-silkie/vlfeedback	An annotated preference dataset and training framework for improving large vision language models.	88