vl-interp

Hallucination mitigation

This project provides an official PyTorch implementation of a method to interpret and edit vision-language representations to mitigate hallucinations in image captions.

Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"

GitHub

42 stars
5 watching
5 forks
Language: Python
last commit: 15 days ago

Related projects:

Repository Description Stars
xiadingz/video-caption.pytorch PyTorch implementation of video captioning, combining deep learning and computer vision techniques. 401
fartashf/vsepp A PyTorch implementation of visual-semantic embedding methods for image-caption retrieval 491
kacky24/stylenet A PyTorch implementation of a framework for generating captions with styles for images and videos. 63
ruotianluo/self-critical.pytorch An implementation of Self-critical Sequence Training for Image Captioning and related techniques. 998
yiwuzhong/sub-gc A PyTorch implementation of image captioning models via scene graph decomposition. 96
ruotianluo/imagecaptioning.pytorch A Python-based framework for training and testing image captioning models using PyTorch. 1,456
kaiyangzhou/dassl.pytorch A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. 1,227
billchan226/halc An implementation of an object hallucination reduction method using a PyTorch framework and various decoding algorithms. 70
byungkwanlee/collavo Develops a PyTorch implementation of an enhanced vision language model 93
woozzu/dong_iccv_2017 An implementation of semantic image synthesis via adversarial learning using PyTorch 145
mrzhu-cool/pix2pix-pytorch Implementing Image-to-Image Translation Using Conditional Adversarial Networks in PyTorch 428
byungkwanlee/moai Improves performance of vision language tasks by integrating computer vision capabilities into large language models 311
boyuanjiang/context_encoder_pytorch A PyTorch implementation of a deep learning model for inpainting images using contextual information 366
yiyangzhou/lure Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. 135
lalbj/pai Improves the performance of large language models by intervening in their internal workings to reduce hallucinations 74