GPT4RoI

Region-of-Interest Training

Training and deploying large language models on computer vision tasks using region-of-interest inputs

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

GitHub

517 stars
8 watching
25 forks
Language: Python
last commit: 6 months ago
computer-visiongptllmmultimodalityroi

Related projects:

Repository Description Stars
pku-yuangroup/languagebind Extending pretraining models to handle multiple modalities by aligning language and video representations 751
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,167
openai/lm-human-preferences Training methods and tools for fine-tuning language models using human preferences 1,240
pku-yuangroup/moe-llava A large vision-language model using a mixture-of-experts architecture to improve performance on multi-modal learning tasks 2,023
megvii-research/tlc Improves image restoration performance by converting global operations to local ones during inference 231
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591
yiren-jian/blitext Develops and trains models for vision-language learning with decoupled language pre-training 24
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
byungkwanlee/moai Improves performance of vision language tasks by integrating computer vision capabilities into large language models 314
ailab-cvc/gpt4tools An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings. 762
kaiyangzhou/dassl.pytorch A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. 1,236
rdspring1/pytorch_gbw_lm Trains a large-scale PyTorch language model on the 1-Billion Word dataset 123
microsoft/megatron-deepspeed Research tool for training large transformer language models at scale 1,926
pku-yuangroup/video-bench Evaluates and benchmarks large language models' video understanding capabilities 121
ethanyanjiali/minchatgpt This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. 214