r2c
Visual Reasoning Model
An open-source project providing PyTorch code and data for a deep learning model that enables visual commonsense reasoning.
Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)
466 stars
16 watching
91 forks
Language: Python
last commit: over 3 years ago commonsensereasoningvcrvisionvisualvisual-commonsense-reasoning
Related projects:
Repository | Description | Stars |
---|---|---|
davidmascharka/tbd-nets | An open-source implementation of a deep learning model designed to improve the balance between performance and interpretability in visual reasoning tasks. | 348 |
deepcs233/visual-cot | Develops a multi-modal language model with a comprehensive dataset and benchmark for chain-of-thought reasoning | 134 |
nexusapoorvacus/deepvariationstructuredrl | An implementation of reinforcement learning for visual relationship and attribute detection using PyTorch. | 63 |
kunpengli1994/vsrn | An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching | 294 |
cadene/vqa.pytorch | A PyTorch implementation of visual question answering with multimodal representation learning | 716 |
rucaibox/comvint | Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks | 18 |
kimhc6028/relational-networks | A PyTorch implementation of a neural network module for relational reasoning in computer vision tasks | 812 |
mrgemy95/visual-interaction-networks-pytorch | An implementation of Deepmind's Visual Interaction Networks using PyTorch to predict future events in physical scenes. | 166 |
l0sg/relational-rnn-pytorch | An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling | 244 |
kaiyangzhou/dassl.pytorch | A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,217 |
anuragranj/back2future.pytorch | An implementation of unsupervised learning for multi-frame optical flow with occlusions using PyTorch. | 111 |
nvlabs/bongard-hoi | A benchmarking tool and software framework for evaluating few-shot visual reasoning capabilities in computer vision models. | 64 |
rowanz/neural-motifs | A software framework for scene graph parsing with global context using PyTorch and Visual Genome data. | 525 |
javeywang/pyramid-attention-networks-pytorch | An implementation of a deep learning model using PyTorch for semantic segmentation tasks. | 235 |
randl/shufflenetv2-pytorch | An implementation of a lightweight convolutional neural network architecture for mobile devices | 191 |