r2c

Visual Reasoning Model

An open-source project providing PyTorch code and data for a deep learning model that enables visual commonsense reasoning.

Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)

GitHub

466 stars

16 watching

91 forks

Language: Python

last commit: about 4 years ago

commonsensereasoningvcrvisionvisualvisual-commonsense-reasoning

visualcommonsense.com

Related projects:

Repository	Description	Stars
davidmascharka/tbd-nets	An open-source implementation of a deep learning model designed to improve the balance between performance and interpretability in visual reasoning tasks.	348
deepcs233/visual-cot	A framework for training multi-modal language models with a focus on visual inputs and providing interpretable thoughts.	162
nexusapoorvacus/deepvariationstructuredrl	An implementation of reinforcement learning for visual relationship and attribute detection using PyTorch.	63
kunpengli1994/vsrn	An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching	294
cadene/vqa.pytorch	A PyTorch implementation of visual question answering with multimodal representation learning	718
rucaibox/comvint	Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks	18
kimhc6028/relational-networks	A PyTorch implementation of a neural network module for relational reasoning in computer vision tasks	809
mrgemy95/visual-interaction-networks-pytorch	An implementation of Deepmind's Visual Interaction Networks using PyTorch to predict future events in physical scenes.	166
l0sg/relational-rnn-pytorch	An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling	245
kaiyangzhou/dassl.pytorch	A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision.	1,236
anuragranj/back2future.pytorch	An implementation of unsupervised learning for multi-frame optical flow with occlusions using PyTorch.	112
nvlabs/bongard-hoi	A benchmarking tool and software framework for evaluating few-shot visual reasoning capabilities in computer vision models.	64
rowanz/neural-motifs	A software framework for scene graph parsing with global context using PyTorch and Visual Genome data.	526
javeywang/pyramid-attention-networks-pytorch	An implementation of a deep learning model using PyTorch for semantic segmentation tasks.	237
randl/shufflenetv2-pytorch	An implementation of a lightweight convolutional neural network architecture for mobile devices	191