mPLUG-HalOwl

Hallucination tester

Evaluates and mitigates hallucinations in multimodal large language models

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

GitHub

82 stars

1 watching

2 forks

Language: Python

last commit: almost 2 years ago

benchmarkcontrastive-learninghallucinationsmllmmultimodal-hallucinationmultimodal-large-language-models

Related projects:

Repository	Description	Stars
bradyfu/woodpecker	A method to correct hallucinations in multimodal large language models without requiring retraining	617
junyangwang0410/amber	An LLM-free benchmark suite for evaluating MLLMs' hallucination capabilities in various tasks and dimensions	98
junyangwang0410/haelm	A framework for detecting hallucinations in large language models	17
mshukor/evalign-icl	Evaluating and improving large multimodal models through in-context learning	21
fuxiaoliu/lrv-instruction	A research project focused on mitigating hallucinations in large multi-modal models by improving instruction tuning through robust training methods.	262
1zhou-wang/memvr	An implementation of a method to mitigate hallucinations in large language models using visual re-tracing	28
lalbj/pai	Improves the performance of large language models by intervening in their internal workings to reduce hallucinations	83
tianyi-lab/hallusionbench	An image-context reasoning benchmark designed to challenge large vision-language models and help improve their accuracy	259
amazon-science/refchecker	Automates fine-grained hallucination detection in large language model outputs	325
yuqifan1117/hallucidoctor	This project provides tools and frameworks to mitigate hallucinatory toxicity in visual instruction data, allowing researchers to fine-tune MLLM models on specific datasets.	41
billchan226/halc	An implementation of an object hallucination reduction method using a PyTorch framework and various decoding algorithms.	72
yiyangzhou/lure	Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability.	136
yuezih/less-is-more	Improving multimodal hallucination mitigation in EOS decision-making by selectively supervising training data	39
yfzhang114/llava-align	Debiasing techniques to minimize hallucinations in large visual language models	75
openmoss/halluqa	An evaluation framework for assessing the performance of large language models on question-answering tasks with hallucination detection	111