Bongard-HOI
Visual Reasoning Benchmark
A benchmarking tool and software framework for evaluating few-shot visual reasoning capabilities in computer vision models.
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
64 stars
7 watching
7 forks
Language: Python
last commit: about 2 years ago cvpr2022few-shot-learningpytorchvisual-reasoning
Related projects:
Repository | Description | Stars |
---|---|---|
nvlabs/relvit | A deep learning framework designed to improve visual reasoning capabilities by utilizing concepts and semantic relations. | 64 |
tianyi-lab/hallusionbench | An image-context reasoning benchmark designed to challenge large vision-language models and help improve their accuracy | 243 |
nvlabs/bongard-logo | Generates synthetic Bongard problems with minimal human intervention. | 51 |
davidmascharka/tbd-nets | An open-source implementation of a deep learning model designed to improve the balance between performance and interpretability in visual reasoning tasks. | 348 |
ailab-cvc/seed-bench | A benchmark for evaluating large language models' ability to process multimodal input | 315 |
rowanz/r2c | An open-source project providing PyTorch code and data for a deep learning model that enables visual commonsense reasoning. | 466 |
lxtgh/omg-seg | Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. | 1,300 |
yuqifan1117/hallucidoctor | This project provides tools and frameworks to mitigate hallucinatory toxicity in visual instruction data, allowing researchers to fine-tune MLLM models on specific datasets. | 41 |
nvlabs/prismer | A deep learning framework for training multi-modal models with vision and language capabilities. | 1,298 |
lavi-lab/visual-table | A project that generates visual representations tailored for general visual reasoning, leveraging hierarchical scene descriptions and instance-level world knowledge. | 14 |
kunpengli1994/vsrn | An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching | 294 |
nv-tlabs/steal | Develops a method to create high-quality training data from noisy labels in semantic segmentation tasks. | 478 |
bonlime/keras-deeplab-v3-plus | An implementation of Deeplabv3+ in Keras with pretrained weights and customization options for semantic image segmentation. | 1,358 |
hms-dbmi/viv | A toolkit for interactive visualization of high-resolution bioimaging data. | 286 |
vcciv/blvd | A large-scale 5D semantics benchmark for autonomous driving | 170 |