FAITHSCORE

Hallucination detector

Evaluates answers generated by large vision-language models to assess hallucinations

27 stars

2 watching

4 forks

Language: Python

last commit: over 1 year ago

Related projects:

Repository	Description	Stars
amazon-science/refchecker	Automates fine-grained hallucination detection in large language model outputs	325
bradyfu/woodpecker	A method to correct hallucinations in multimodal large language models without requiring retraining	617
damo-nlp-sg/vcd	An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs	222
junyangwang0410/haelm	A framework for detecting hallucinations in large language models	17
tianyi-lab/hallusionbench	An image-context reasoning benchmark designed to challenge large vision-language models and help improve their accuracy	259
rucaibox/pope	An evaluation framework for detecting object hallucinations in vision-language models	187
yiyangzhou/lure	Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability.	136
x-plug/mplug-halowl	Evaluates and mitigates hallucinations in multimodal large language models	82
openkg-org/easydetect	A framework to detect and mitigate hallucinations in multimodal large language models	48
assafbk/mocha_code	A unified framework and benchmark for detecting and mitigating hallucinations in open-vocabulary image captioning models	13
yfzhang114/llava-align	Debiasing techniques to minimize hallucinations in large visual language models	75
openmoss/halluqa	An evaluation framework for assessing the performance of large language models on question-answering tasks with hallucination detection	111
lalbj/pai	Improves the performance of large language models by intervening in their internal workings to reduce hallucinations	83
yuqifan1117/hallucidoctor	This project provides tools and frameworks to mitigate hallucinatory toxicity in visual instruction data, allowing researchers to fine-tune MLLM models on specific datasets.	41
junyangwang0410/amber	An LLM-free benchmark suite for evaluating MLLMs' hallucination capabilities in various tasks and dimensions	98