Libra

Decoupled vision system

An implementation of a decoupled vision system using large language models

Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)

GitHub

143 stars
2 watching
2 forks
Language: Python
last commit: about 1 month ago

Related projects:

Repository Description Stars
yiren-jian/blitext Develops and trains models for vision-language learning with decoupled language pre-training 24
yfzhang114/llava-align Debiasing techniques to minimize hallucinations in large visual language models 71
shizhediao/davinci An implementation of vision-language models for multimodal learning tasks, enabling generative vision-language models to be fine-tuned for various applications. 43
algolzw/daclip-uir This project controls vision-language models to restore degraded images in various environments and conditions. 662
byungkwanlee/moai Improves performance of vision language tasks by integrating computer vision capabilities into large language models 311
jiahuadong/fiss Implementations of federated incremental semantic segmentation in PyTorch. 33
byungkwanlee/collavo Develops a PyTorch implementation of an enhanced vision language model 93
luispedro/mahotas A library of fast computer vision algorithms implemented in C++ for speed, operating over numpy arrays. 844
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 508
yunlongdong/fcn-pytorch A PyTorch implementation of FCN for semantic segmentation with an easy-to-use interface and pre-trained models. 160
fyu/dilation This project provides a deep learning framework implementing dilated convolutions for semantic image segmentation 781
nickjiang2378/vl-interp This project provides an official PyTorch implementation of a method to interpret and edit vision-language representations to mitigate hallucinations in image captions. 31
yiyangzhou/lure Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. 134
codeslake/ifan Implementation of an algorithm for single image deblurring in images with defocus blur 227
baaivision/eve A PyTorch implementation of an encoder-free vision-language model that can be fine-tuned for various tasks and modalities 230