Libra
Decoupled vision system
An implementation of a decoupled vision system using large language models
Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)
143 stars
2 watching
2 forks
Language: Python
last commit: about 1 month ago Related projects:
Repository | Description | Stars |
---|---|---|
yiren-jian/blitext | Develops and trains models for vision-language learning with decoupled language pre-training | 24 |
yfzhang114/llava-align | Debiasing techniques to minimize hallucinations in large visual language models | 71 |
shizhediao/davinci | An implementation of vision-language models for multimodal learning tasks, enabling generative vision-language models to be fine-tuned for various applications. | 43 |
algolzw/daclip-uir | This project controls vision-language models to restore degraded images in various environments and conditions. | 662 |
byungkwanlee/moai | Improves performance of vision language tasks by integrating computer vision capabilities into large language models | 311 |
jiahuadong/fiss | Implementations of federated incremental semantic segmentation in PyTorch. | 33 |
byungkwanlee/collavo | Develops a PyTorch implementation of an enhanced vision language model | 93 |
luispedro/mahotas | A library of fast computer vision algorithms implemented in C++ for speed, operating over numpy arrays. | 844 |
luogen1996/lavin | An open-source implementation of a vision-language instructed large language model | 508 |
yunlongdong/fcn-pytorch | A PyTorch implementation of FCN for semantic segmentation with an easy-to-use interface and pre-trained models. | 160 |
fyu/dilation | This project provides a deep learning framework implementing dilated convolutions for semantic image segmentation | 781 |
nickjiang2378/vl-interp | This project provides an official PyTorch implementation of a method to interpret and edit vision-language representations to mitigate hallucinations in image captions. | 31 |
yiyangzhou/lure | Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. | 134 |
codeslake/ifan | Implementation of an algorithm for single image deblurring in images with defocus blur | 227 |
baaivision/eve | A PyTorch implementation of an encoder-free vision-language model that can be fine-tuned for various tasks and modalities | 230 |