Semantic_Compositional_Nets
Image Captioning Model
A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks
The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"
70 stars
6 watching
24 forks
Language: Python
last commit: almost 7 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| A deep learning model for generating image captions with semantic attention | 51 |
| This implementation allows users to generate captions from images using a neural network model with visual attention. | 790 |
| An implementation of a deep learning model for semantic segmentation using a novel attention mechanism to capture long-range dependencies in images. | 1,432 |
| Implementations of deep learning architectures for semantic segmentation of images in various datasets. | 6 |
| Deep learning models for semantic segmentation of images | 101 |
| This code implements a neural network architecture designed to perform semantic segmentation in computer vision tasks. | 920 |
| A PyTorch implementation of a deep learning model for semantic image segmentation | 1,598 |
| An open-source implementation of an image segmentation model that combines background removal and object detection capabilities. | 1,484 |
| An implementation of a dense video captioning model with attention-based fusion and context gating | 149 |
| A framework for training multi-modal language models with a focus on visual inputs and providing interpretable thoughts. | 162 |
| Unsupervised feature learning by image inpainting using Generative Adversarial Networks (GANs) | 887 |
| A Python implementation of a deep neural network architecture for semantic image segmentation | 48 |
| Trains an autoencoder to learn generic sentence representations using convolutional neural networks | 34 |
| An implementation of an efficient deep neural network architecture | 189 |
| An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions | 200 |