Semantic_Compositional_Nets

Image Captioning Model

A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks

The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"

GitHub

70 stars

6 watching

24 forks

Language: Python

last commit: over 8 years ago

Related projects:

Repository	Description	Stars
chapternewscu/image-captioning-with-semantic-attention	A deep learning model for generating image captions with semantic attention	51
deeprnn/image_captioning	This implementation allows users to generate captions from images using a neural network model with visual attention.	790
speedinghzl/ccnet	An implementation of a deep learning model for semantic segmentation using a novel attention mechanism to capture long-range dependencies in images.	1,432
zhujun98/semantic_segmentation	Implementations of deep learning architectures for semantic segmentation of images in various datasets.	6
preritj/segmentation	Deep learning models for semantic segmentation of images	101
nv-tlabs/gscnn	This code implements a neural network architecture designed to perform semantic segmentation in computer vision tasks.	920
hszhao/pspnet	A PyTorch implementation of a deep learning model for semantic image segmentation	1,598
zhengpeng7/birefnet	An open-source implementation of an image segmentation model that combines background removal and object detection capabilities.	1,484
jaywongwang/densevideocaptioning	An implementation of a dense video captioning model with attention-based fusion and context gating	149
deepcs233/visual-cot	A framework for training multi-modal language models with a focus on visual inputs and providing interpretable thoughts.	162
pathak22/context-encoder	Unsupervised feature learning by image inpainting using Generative Adversarial Networks (GANs)	887
k3nt0w/fcn_via_keras	A Python implementation of a deep neural network architecture for semantic image segmentation	48
zhegan27/convsent	Trains an autoencoder to learn generic sentence representations using convolutional neural networks	34
homles11/igcv3	An implementation of an efficient deep neural network architecture	189
cshizhe/asg2cap	An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions	200