r2c

Visual Reasoning Model

An open-source project providing PyTorch code and data for a deep learning model that enables visual commonsense reasoning.

Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)

GitHub

466 stars
16 watching
91 forks
Language: Python
last commit: over 3 years ago
commonsensereasoningvcrvisionvisualvisual-commonsense-reasoning

Related projects:

Repository Description Stars
davidmascharka/tbd-nets An open-source implementation of a deep learning model designed to improve the balance between performance and interpretability in visual reasoning tasks. 348
deepcs233/visual-cot Develops a multi-modal language model with a comprehensive dataset and benchmark for chain-of-thought reasoning 134
nexusapoorvacus/deepvariationstructuredrl An implementation of reinforcement learning for visual relationship and attribute detection using PyTorch. 63
kunpengli1994/vsrn An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching 294
cadene/vqa.pytorch A PyTorch implementation of visual question answering with multimodal representation learning 716
rucaibox/comvint Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks 18
kimhc6028/relational-networks A PyTorch implementation of a neural network module for relational reasoning in computer vision tasks 812
mrgemy95/visual-interaction-networks-pytorch An implementation of Deepmind's Visual Interaction Networks using PyTorch to predict future events in physical scenes. 166
l0sg/relational-rnn-pytorch An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling 244
kaiyangzhou/dassl.pytorch A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. 1,217
anuragranj/back2future.pytorch An implementation of unsupervised learning for multi-frame optical flow with occlusions using PyTorch. 111
nvlabs/bongard-hoi A benchmarking tool and software framework for evaluating few-shot visual reasoning capabilities in computer vision models. 64
rowanz/neural-motifs A software framework for scene graph parsing with global context using PyTorch and Visual Genome data. 525
javeywang/pyramid-attention-networks-pytorch An implementation of a deep learning model using PyTorch for semantic segmentation tasks. 235
randl/shufflenetv2-pytorch An implementation of a lightweight convolutional neural network architecture for mobile devices 191