Semantic_Compositional_Nets
Image Captioning Model
A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks
The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"
70 stars
6 watching
24 forks
Language: Python
last commit: almost 7 years ago Related projects:
Repository | Description | Stars |
---|---|---|
chapternewscu/image-captioning-with-semantic-attention | A deep learning model for generating image captions with semantic attention | 51 |
deeprnn/image_captioning | This implementation allows users to generate captions from images using a neural network model with visual attention. | 790 |
speedinghzl/ccnet | An implementation of a deep learning model for semantic segmentation using a novel attention mechanism to capture long-range dependencies in images. | 1,432 |
zhujun98/semantic_segmentation | Implementations of deep learning architectures for semantic segmentation of images in various datasets. | 6 |
preritj/segmentation | Deep learning models for semantic segmentation of images | 101 |
nv-tlabs/gscnn | This code implements a neural network architecture designed to perform semantic segmentation in computer vision tasks. | 920 |
hszhao/pspnet | A PyTorch implementation of a deep learning model for semantic image segmentation | 1,598 |
zhengpeng7/birefnet | An open-source implementation of an image segmentation model that combines background removal and object detection capabilities. | 1,484 |
jaywongwang/densevideocaptioning | An implementation of a dense video captioning model with attention-based fusion and context gating | 149 |
deepcs233/visual-cot | A framework for training multi-modal language models with a focus on visual inputs and providing interpretable thoughts. | 162 |
pathak22/context-encoder | Unsupervised feature learning by image inpainting using Generative Adversarial Networks (GANs) | 887 |
k3nt0w/fcn_via_keras | A Python implementation of a deep neural network architecture for semantic image segmentation | 48 |
zhegan27/convsent | Trains an autoencoder to learn generic sentence representations using convolutional neural networks | 34 |
homles11/igcv3 | An implementation of an efficient deep neural network architecture | 189 |
cshizhe/asg2cap | An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions | 200 |