vl-interp
Hallucination mitigation
This project provides an official PyTorch implementation of a method to interpret and edit vision-language representations to mitigate hallucinations in image captions.
Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"
42 stars
5 watching
5 forks
Language: Python
last commit: 15 days ago Related projects:
Repository | Description | Stars |
---|---|---|
xiadingz/video-caption.pytorch | PyTorch implementation of video captioning, combining deep learning and computer vision techniques. | 401 |
fartashf/vsepp | A PyTorch implementation of visual-semantic embedding methods for image-caption retrieval | 491 |
kacky24/stylenet | A PyTorch implementation of a framework for generating captions with styles for images and videos. | 63 |
ruotianluo/self-critical.pytorch | An implementation of Self-critical Sequence Training for Image Captioning and related techniques. | 998 |
yiwuzhong/sub-gc | A PyTorch implementation of image captioning models via scene graph decomposition. | 96 |
ruotianluo/imagecaptioning.pytorch | A Python-based framework for training and testing image captioning models using PyTorch. | 1,456 |
kaiyangzhou/dassl.pytorch | A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,227 |
billchan226/halc | An implementation of an object hallucination reduction method using a PyTorch framework and various decoding algorithms. | 70 |
byungkwanlee/collavo | Develops a PyTorch implementation of an enhanced vision language model | 93 |
woozzu/dong_iccv_2017 | An implementation of semantic image synthesis via adversarial learning using PyTorch | 145 |
mrzhu-cool/pix2pix-pytorch | Implementing Image-to-Image Translation Using Conditional Adversarial Networks in PyTorch | 428 |
byungkwanlee/moai | Improves performance of vision language tasks by integrating computer vision capabilities into large language models | 311 |
boyuanjiang/context_encoder_pytorch | A PyTorch implementation of a deep learning model for inpainting images using contextual information | 366 |
yiyangzhou/lure | Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. | 135 |
lalbj/pai | Improves the performance of large language models by intervening in their internal workings to reduce hallucinations | 74 |