VIGC

Instruction dataset generator

Autonomously generates high-quality image-text instruction fine-tuning datasets

AAAI 2024: Visual Instruction Generation and Correction

91 stars

5 watching

3 forks

Language: Python

last commit: over 1 year ago

Screenshot of opendatalab/VIGC website

opendatalab.github.io/VIGC/

Related projects:

Repository	Description	Stars
rucaibox/comvint	Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks	18
opendatalab/mllm-dataengine	Automates data generation and model training for improving MLLM capabilities	39
avi-d-coder/implicit-hie	Automates the creation of cabal or stack configuration files for multi-component Haskell projects.	204
cvlab-columbia/viper	A framework for generating and executing Python code to solve visual inference tasks using large language models	1,666
dense-analysis/neural	An AI-powered plugin for Vim and Neovim that generates code and performs tasks such as text completion and error checking.	472
bin123apple/autocoder	An AI model designed to generate and execute code automatically	816
iamwangyunkai/carla_py	Generates data for CARLA's visual navigation system using raw camera images and instructions.	8
ncsoft/cap2qa	A dataset and implementation of a method to generate instructions based on visual data	5
sauci/pydbc	Generates an Abstract Syntax Tree based on DBC-formatted strings	2
aidc-ai/parrot	A method and toolkit for fine-tuning large language models to perform visual instruction tasks in multiple languages.	34
h21lab/5gc_build	A tool for generating code from 5G API definitions using OpenAPI generators	14
lxtgh/omg-seg	Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model.	1,336
ranjaykrishna/visual_genome_python_driver	A Python wrapper providing access to the Visual Genome dataset by downloading and parsing local data	357
baai-dcai/visual-instruction-tuning	A dataset and model designed to scale visual instruction tuning using language-only GPT-4 models.	164
taoxugit/attngan	Reproduces text-to-image generation with attentional generative adversarial networks.	1,343