Semantic_Compositional_Nets

Image Captioning Model

A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks

The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"

GitHub

70 stars
6 watching
24 forks
Language: Python
last commit: over 6 years ago

Related projects:

Repository Description Stars
chapternewscu/image-captioning-with-semantic-attention A deep learning model for generating image captions with semantic attention 51
deeprnn/image_captioning This implementation allows users to generate captions from images using a neural network model with visual attention. 786
speedinghzl/ccnet An implementation of a deep learning model for semantic segmentation using a novel attention mechanism to capture long-range dependencies in images. 1,426
zhujun98/semantic_segmentation Implementations of deep learning architectures for semantic segmentation of images in various datasets. 6
preritj/segmentation Deep learning models for semantic segmentation of images 100
nv-tlabs/gscnn This code implements a neural network architecture designed to perform semantic segmentation in computer vision tasks. 920
hszhao/pspnet A PyTorch implementation of a deep learning model for semantic image segmentation 1,593
zhengpeng7/birefnet An implementation of a deep learning-based image segmentation model for high-resolution images 1,319
jaywongwang/densevideocaptioning An implementation of a dense video captioning model with attention-based fusion and context gating 148
deepcs233/visual-cot Develops a multi-modal language model with a comprehensive dataset and benchmark for chain-of-thought reasoning 134
pathak22/context-encoder Unsupervised feature learning by image inpainting using Generative Adversarial Networks (GANs) 885
k3nt0w/fcn_via_keras A Python implementation of a deep neural network architecture for semantic image segmentation 48
zhegan27/convsent Trains an autoencoder to learn generic sentence representations using convolutional neural networks 34
homles11/igcv3 An implementation of an efficient deep neural network architecture 189
cshizhe/asg2cap An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions 200