R-Bench
Relationship validation dataset
A repository providing preprocessed data and tools for evaluating and analyzing relationship hallucinations in large vision-language models.
[ICML2024] Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models'
20 stars
2 watching
0 forks
Language: Python
last commit: about 1 month ago Related projects:
Repository | Description | Stars |
---|---|---|
yangxuntu/vrd | This repository provides a framework for visual relationship detection using deep learning models and pre-processing tools. | 94 |
prof-lu-cewu/visual-relationship-detection | Detects relationships and predicts predicates in images using language priors | 214 |
yiyangzhou/lure | Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. | 134 |
benmarwick/rrtools | Tools for creating reproducible research projects in R using Quarto and version control. | 674 |
doubledaibo/drnet_cvpr2017 | This project presents a deep neural network architecture designed to detect visual relationships in images. | 202 |
stanfordvl/referringrelationships | This repository provides code and tools for training and evaluating models of referring relationships in computer vision | 260 |
tianyi-lab/hallusionbench | An image-context reasoning benchmark designed to challenge large vision-language models and help improve their accuracy | 243 |
tensorflow/data-validation | A library for exploring and validating machine learning data in TensorFlow | 765 |
rrrlw/icon | Provides access to complex systems datasets from the Index of Complex Networks (ICON) database. | 7 |
yuweihao/mm-vet | Evaluates the capabilities of large multimodal models using a set of diverse tasks and metrics | 267 |
tomoakin/rpostgresql | A C library that provides an R interface to PostgreSQL | 64 |
psirenny/derby-validate | A validation library designed to simplify the use of Derby JS in JavaScript applications. | 1 |
arthur151/relative_human | Provides a toolbox for loading, visualizing, and evaluating a dataset of images with human annotations, including depth layers and age group classification. | 138 |
ymcui/cmrc2018 | A collection of data for evaluating Chinese machine reading comprehension systems | 415 |
modeloriented/drwhy | A collection of tools and guidelines for building responsible machine learning models | 680 |