Semantic_Compositional_Nets
Image Captioning Model
A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks
The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"
70 stars
6 watching
24 forks
Language: Python
last commit: over 6 years ago Related projects:
Repository | Description | Stars |
---|---|---|
chapternewscu/image-captioning-with-semantic-attention | A deep learning model for generating image captions with semantic attention | 51 |
deeprnn/image_captioning | This implementation allows users to generate captions from images using a neural network model with visual attention. | 786 |
speedinghzl/ccnet | An implementation of a deep learning model for semantic segmentation using a novel attention mechanism to capture long-range dependencies in images. | 1,426 |
zhujun98/semantic_segmentation | Implementations of deep learning architectures for semantic segmentation of images in various datasets. | 6 |
preritj/segmentation | Deep learning models for semantic segmentation of images | 100 |
nv-tlabs/gscnn | This code implements a neural network architecture designed to perform semantic segmentation in computer vision tasks. | 920 |
hszhao/pspnet | A PyTorch implementation of a deep learning model for semantic image segmentation | 1,593 |
zhengpeng7/birefnet | An implementation of a deep learning-based image segmentation model for high-resolution images | 1,319 |
jaywongwang/densevideocaptioning | An implementation of a dense video captioning model with attention-based fusion and context gating | 148 |
deepcs233/visual-cot | Develops a multi-modal language model with a comprehensive dataset and benchmark for chain-of-thought reasoning | 134 |
pathak22/context-encoder | Unsupervised feature learning by image inpainting using Generative Adversarial Networks (GANs) | 885 |
k3nt0w/fcn_via_keras | A Python implementation of a deep neural network architecture for semantic image segmentation | 48 |
zhegan27/convsent | Trains an autoencoder to learn generic sentence representations using convolutional neural networks | 34 |
homles11/igcv3 | An implementation of an efficient deep neural network architecture | 189 |
cshizhe/asg2cap | An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions | 200 |