GPT4RoI
Region-of-Interest Training
Training and deploying large language models on computer vision tasks using region-of-interest inputs
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
517 stars
8 watching
25 forks
Language: Python
last commit: 6 months ago computer-visiongptllmmultimodalityroi
Related projects:
Repository | Description | Stars |
---|---|---|
pku-yuangroup/languagebind | Extending pretraining models to handle multiple modalities by aligning language and video representations | 751 |
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,167 |
openai/lm-human-preferences | Training methods and tools for fine-tuning language models using human preferences | 1,240 |
pku-yuangroup/moe-llava | A large vision-language model using a mixture-of-experts architecture to improve performance on multi-modal learning tasks | 2,023 |
megvii-research/tlc | Improves image restoration performance by converting global operations to local ones during inference | 231 |
shawn-ieitsystems/yuan-1.0 | Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing | 591 |
yiren-jian/blitext | Develops and trains models for vision-language learning with decoupled language pre-training | 24 |
brightmart/xlnet_zh | Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
byungkwanlee/moai | Improves performance of vision language tasks by integrating computer vision capabilities into large language models | 314 |
ailab-cvc/gpt4tools | An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings. | 762 |
kaiyangzhou/dassl.pytorch | A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,236 |
rdspring1/pytorch_gbw_lm | Trains a large-scale PyTorch language model on the 1-Billion Word dataset | 123 |
microsoft/megatron-deepspeed | Research tool for training large transformer language models at scale | 1,926 |
pku-yuangroup/video-bench | Evaluates and benchmarks large language models' video understanding capabilities | 121 |
ethanyanjiali/minchatgpt | This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. | 214 |