GPT4RoI

Region-of-Interest Training

Training and deploying large language models on computer vision tasks using region-of-interest inputs

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

GitHub

517 stars

8 watching

25 forks

Language: Python

last commit: about 1 year ago

computer-visiongptllmmultimodalityroi

Related projects:

Repository	Description	Stars
pku-yuangroup/languagebind	Extending pretraining models to handle multiple modalities by aligning language and video representations	751
openai/finetune-transformer-lm	This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture.	2,167
openai/lm-human-preferences	Training methods and tools for fine-tuning language models using human preferences	1,240
pku-yuangroup/moe-llava	A large vision-language model using a mixture-of-experts architecture to improve performance on multi-modal learning tasks	2,023
megvii-research/tlc	Improves image restoration performance by converting global operations to local ones during inference	231
shawn-ieitsystems/yuan-1.0	Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing	591
yiren-jian/blitext	Develops and trains models for vision-language learning with decoupled language pre-training	24
brightmart/xlnet_zh	Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks	230
byungkwanlee/moai	Improves performance of vision language tasks by integrating computer vision capabilities into large language models	314
ailab-cvc/gpt4tools	An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings.	762
kaiyangzhou/dassl.pytorch	A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision.	1,236
rdspring1/pytorch_gbw_lm	Trains a large-scale PyTorch language model on the 1-Billion Word dataset	123
microsoft/megatron-deepspeed	Research tool for training large transformer language models at scale	1,926
pku-yuangroup/video-bench	Evaluates and benchmarks large language models' video understanding capabilities	121
ethanyanjiali/minchatgpt	This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2.	214