mPLUG-HalOwl

Hallucination tester

Evaluates and mitigates hallucinations in multimodal large language models

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

GitHub

82 stars
1 watching
2 forks
Language: Python
last commit: 12 months ago
benchmarkcontrastive-learninghallucinationsmllmmultimodal-hallucinationmultimodal-large-language-models

Related projects:

Repository Description Stars
bradyfu/woodpecker A method to correct hallucinations in multimodal large language models without requiring retraining 617
junyangwang0410/amber An LLM-free benchmark suite for evaluating MLLMs' hallucination capabilities in various tasks and dimensions 98
junyangwang0410/haelm A framework for detecting hallucinations in large language models 17
mshukor/evalign-icl Evaluating and improving large multimodal models through in-context learning 21
fuxiaoliu/lrv-instruction A research project focused on mitigating hallucinations in large multi-modal models by improving instruction tuning through robust training methods. 262
1zhou-wang/memvr An implementation of a method to mitigate hallucinations in large language models using visual re-tracing 28
lalbj/pai Improves the performance of large language models by intervening in their internal workings to reduce hallucinations 83
tianyi-lab/hallusionbench An image-context reasoning benchmark designed to challenge large vision-language models and help improve their accuracy 259
amazon-science/refchecker Automates fine-grained hallucination detection in large language model outputs 325
yuqifan1117/hallucidoctor This project provides tools and frameworks to mitigate hallucinatory toxicity in visual instruction data, allowing researchers to fine-tune MLLM models on specific datasets. 41
billchan226/halc An implementation of an object hallucination reduction method using a PyTorch framework and various decoding algorithms. 72
yiyangzhou/lure Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. 136
yuezih/less-is-more Improving multimodal hallucination mitigation in EOS decision-making by selectively supervising training data 39
yfzhang114/llava-align Debiasing techniques to minimize hallucinations in large visual language models 75
openmoss/halluqa An evaluation framework for assessing the performance of large language models on question-answering tasks with hallucination detection 111