VCD

Object detection method

An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

GitHub

209 stars
6 watching
9 forks
Language: Python
last commit: about 2 months ago

Related projects:

Repository Description Stars
szq0214/dsod A deep learning-based object detection system from scratch 706
lartpang/ovcamo Develops an object segmentation algorithm to detect camouflaged objects in images with varying backgrounds and contexts. 20
yfzhang114/llava-align Debiasing techniques to minimize hallucinations in large visual language models 71
yuhangzang/contextdet An approach to detecting objects in images using multimodal large language models and contextual information 202
bcdnlp/faithscore Evaluates answers generated by large vision-language models to assess hallucinations 25
rucaibox/pope An evaluation framework for detecting object hallucinations in vision-language models 179
junyangwang0410/haelm A framework for detecting hallucinations in large language models 17
yangxuntu/vrd This repository provides a framework for visual relationship detection using deep learning models and pre-processing tools. 94
shengcao-cao/hassod Develops a neural network architecture for object detection and instance segmentation without labeled data 51
chenyuntc/dsod.pytorch An implementation of the Deep Supervised Object Detector from scratch using PyTorch. 70
jingzhang617/cod-rank-localize-and-segment Develops a system to detect, segment, and rank camouflaged objects in images. 74
yiyangzhou/lure Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. 134
ethanhe42/softer-nms An object detection technique using bounding box regression and uncertainty estimation to improve accurate detection results 367
1zhou-wang/memvr An implementation of a method to mitigate hallucinations in large language models using visual re-tracing 27
ibm/max-object-detector A tool for detecting objects in images using deep learning models 290