Libra

Decoupled vision system

An implementation of a decoupled vision system using large language models

Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)

GitHub

153 stars

2 watching

2 forks

Language: Python

last commit: 8 months ago

Related projects:

Repository	Description	Stars
yiren-jian/blitext	Develops and trains models for vision-language learning with decoupled language pre-training	24
yfzhang114/llava-align	Debiasing techniques to minimize hallucinations in large visual language models	75
shizhediao/davinci	Implementing a unified modal learning framework for generative vision-language models	43
algolzw/daclip-uir	This project controls vision-language models to restore degraded images in various environments and conditions.	673
byungkwanlee/moai	Improves performance of vision language tasks by integrating computer vision capabilities into large language models	314
jiahuadong/fiss	Implementations of federated incremental semantic segmentation in PyTorch.	34
byungkwanlee/collavo	Develops a PyTorch implementation of an enhanced vision language model	93
luispedro/mahotas	A library of fast computer vision algorithms implemented in C++ for speed, operating over numpy arrays.	855
luogen1996/lavin	An open-source implementation of a vision-language instructed large language model	513
yunlongdong/fcn-pytorch	A PyTorch implementation of FCN for semantic segmentation with an easy-to-use interface and pre-trained models.	161
fyu/dilation	This project provides a deep learning framework implementing dilated convolutions for semantic image segmentation	782
nickjiang2378/vl-interp	This project provides an official PyTorch implementation of a method to interpret and edit vision-language representations to mitigate hallucinations in image captions.	46
yiyangzhou/lure	Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability.	136
codeslake/ifan	Implementation of an algorithm for single image deblurring in images with defocus blur	228
baaivision/eve	A PyTorch implementation of an encoder-free vision-language model that can be fine-tuned for various tasks and modalities	246