Libra

Decoupled vision system

An implementation of a decoupled vision system using large language models

Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)

GitHub

153 stars
2 watching
2 forks
Language: Python
last commit: about 2 months ago

Related projects:

Repository Description Stars
yiren-jian/blitext Develops and trains models for vision-language learning with decoupled language pre-training 24
yfzhang114/llava-align Debiasing techniques to minimize hallucinations in large visual language models 75
shizhediao/davinci Implementing a unified modal learning framework for generative vision-language models 43
algolzw/daclip-uir This project controls vision-language models to restore degraded images in various environments and conditions. 673
byungkwanlee/moai Improves performance of vision language tasks by integrating computer vision capabilities into large language models 314
jiahuadong/fiss Implementations of federated incremental semantic segmentation in PyTorch. 34
byungkwanlee/collavo Develops a PyTorch implementation of an enhanced vision language model 93
luispedro/mahotas A library of fast computer vision algorithms implemented in C++ for speed, operating over numpy arrays. 855
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 513
yunlongdong/fcn-pytorch A PyTorch implementation of FCN for semantic segmentation with an easy-to-use interface and pre-trained models. 161
fyu/dilation This project provides a deep learning framework implementing dilated convolutions for semantic image segmentation 782
nickjiang2378/vl-interp This project provides an official PyTorch implementation of a method to interpret and edit vision-language representations to mitigate hallucinations in image captions. 46
yiyangzhou/lure Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. 136
codeslake/ifan Implementation of an algorithm for single image deblurring in images with defocus blur 228
baaivision/eve A PyTorch implementation of an encoder-free vision-language model that can be fine-tuned for various tasks and modalities 246