Bongard-HOI

Visual Reasoning Benchmark

A benchmarking tool and software framework for evaluating few-shot visual reasoning capabilities in computer vision models.

[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning

GitHub

64 stars
7 watching
7 forks
Language: Python
last commit: about 2 years ago
cvpr2022few-shot-learningpytorchvisual-reasoning

Related projects:

Repository Description Stars
nvlabs/relvit A deep learning framework designed to improve visual reasoning capabilities by utilizing concepts and semantic relations. 64
tianyi-lab/hallusionbench An image-context reasoning benchmark designed to challenge large vision-language models and help improve their accuracy 243
nvlabs/bongard-logo Generates synthetic Bongard problems with minimal human intervention. 51
davidmascharka/tbd-nets An open-source implementation of a deep learning model designed to improve the balance between performance and interpretability in visual reasoning tasks. 348
ailab-cvc/seed-bench A benchmark for evaluating large language models' ability to process multimodal input 315
rowanz/r2c An open-source project providing PyTorch code and data for a deep learning model that enables visual commonsense reasoning. 466
lxtgh/omg-seg Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. 1,300
yuqifan1117/hallucidoctor This project provides tools and frameworks to mitigate hallucinatory toxicity in visual instruction data, allowing researchers to fine-tune MLLM models on specific datasets. 41
nvlabs/prismer A deep learning framework for training multi-modal models with vision and language capabilities. 1,298
lavi-lab/visual-table A project that generates visual representations tailored for general visual reasoning, leveraging hierarchical scene descriptions and instance-level world knowledge. 14
kunpengli1994/vsrn An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching 294
nv-tlabs/steal Develops a method to create high-quality training data from noisy labels in semantic segmentation tasks. 478
bonlime/keras-deeplab-v3-plus An implementation of Deeplabv3+ in Keras with pretrained weights and customization options for semantic image segmentation. 1,358
hms-dbmi/viv A toolkit for interactive visualization of high-resolution bioimaging data. 286
vcciv/blvd A large-scale 5D semantics benchmark for autonomous driving 170