r2c
Visual Reasoning Model
An open-source project providing PyTorch code and data for a deep learning model that enables visual commonsense reasoning.
Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)
466 stars
16 watching
91 forks
Language: Python
last commit: almost 4 years ago commonsensereasoningvcrvisionvisualvisual-commonsense-reasoning
Related projects:
Repository | Description | Stars |
---|---|---|
| An open-source implementation of a deep learning model designed to improve the balance between performance and interpretability in visual reasoning tasks. | 348 |
| A framework for training multi-modal language models with a focus on visual inputs and providing interpretable thoughts. | 162 |
| An implementation of reinforcement learning for visual relationship and attribute detection using PyTorch. | 63 |
| An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching | 294 |
| A PyTorch implementation of visual question answering with multimodal representation learning | 718 |
| Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks | 18 |
| A PyTorch implementation of a neural network module for relational reasoning in computer vision tasks | 809 |
| An implementation of Deepmind's Visual Interaction Networks using PyTorch to predict future events in physical scenes. | 166 |
| An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling | 245 |
| A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,236 |
| An implementation of unsupervised learning for multi-frame optical flow with occlusions using PyTorch. | 112 |
| A benchmarking tool and software framework for evaluating few-shot visual reasoning capabilities in computer vision models. | 64 |
| A software framework for scene graph parsing with global context using PyTorch and Visual Genome data. | 526 |
| An implementation of a deep learning model using PyTorch for semantic segmentation tasks. | 237 |
| An implementation of a lightweight convolutional neural network architecture for mobile devices | 191 |