Bongard-HOI

Visual Reasoning Benchmark

A benchmarking tool and software framework for evaluating few-shot visual reasoning capabilities in computer vision models.

[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning

GitHub

64 stars

7 watching

7 forks

Language: Python

last commit: almost 3 years ago

cvpr2022few-shot-learningpytorchvisual-reasoning

Related projects:

Repository	Description	Stars
nvlabs/relvit	A deep learning framework designed to improve visual reasoning capabilities by utilizing concepts and semantic relations.	64
tianyi-lab/hallusionbench	An image-context reasoning benchmark designed to challenge large vision-language models and help improve their accuracy	259
nvlabs/bongard-logo	Generates synthetic Bongard problems with minimal human intervention.	51
davidmascharka/tbd-nets	An open-source implementation of a deep learning model designed to improve the balance between performance and interpretability in visual reasoning tasks.	348
ailab-cvc/seed-bench	A benchmark for evaluating large language models' ability to process multimodal input	322
rowanz/r2c	An open-source project providing PyTorch code and data for a deep learning model that enables visual commonsense reasoning.	466
lxtgh/omg-seg	Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model.	1,336
yuqifan1117/hallucidoctor	This project provides tools and frameworks to mitigate hallucinatory toxicity in visual instruction data, allowing researchers to fine-tune MLLM models on specific datasets.	41
nvlabs/prismer	A deep learning framework for training multi-modal models with vision and language capabilities.	1,299
lavi-lab/visual-table	A project that generates visual representations tailored for general visual reasoning, leveraging hierarchical scene descriptions and instance-level world knowledge.	14
kunpengli1994/vsrn	An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching	294
nv-tlabs/steal	Develops a method to create high-quality training data from noisy labels in semantic segmentation tasks.	478
bonlime/keras-deeplab-v3-plus	An implementation of Deeplabv3+ in Keras with pretrained weights and customization options for semantic image segmentation.	1,360
hms-dbmi/viv	A toolkit for interactive visualization of high-resolution bioimaging data.	290
vcciv/blvd	A large-scale 5D semantics benchmark for autonomous driving	171