image-captioning-with-semantic-attention

Image captioning model

A deep learning model for generating image captions with semantic attention

GitHub

51 stars
6 watching
17 forks
Language: Jupyter Notebook
last commit: about 9 years ago

Related projects:

Repository Description Stars
zhegan27/semantic_compositional_nets A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks 70
deeprnn/image_captioning This implementation allows users to generate captions from images using a neural network model with visual attention. 790
jiasenlu/adaptiveattention Adaptive attention mechanism for image captioning using visual sentinels 335
cshizhe/asg2cap An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions 200
fengyang0317/unsupervised_captioning An unsupervised image captioning framework that allows generating captions from images without paired data. 215
contextualai/lens Enhances language models to generate text based on visual descriptions of images 352
ibm/max-image-caption-generator An image caption generation system utilizing machine learning models and deep neural networks. 84
zhujun98/semantic_segmentation Implementations of deep learning architectures for semantic segmentation of images in various datasets. 6
rmokady/clip_prefix_caption An approach to image captioning that leverages the CLIP model and fine-tunes a language model without requiring additional supervision or object annotation. 1,326
yunjey/show-attend-and-tell Generates captions for images using an attention-based neural network 907
apple2373/chainer-caption An image caption generation system using a neural network architecture with pre-trained models. 64
jaywongwang/densevideocaptioning An implementation of a dense video captioning model with attention-based fusion and context gating 149
speedinghzl/ccnet An implementation of a deep learning model for semantic segmentation using a novel attention mechanism to capture long-range dependencies in images. 1,432
zhengpeng7/birefnet An open-source implementation of an image segmentation model that combines background removal and object detection capabilities. 1,484
lukemelas/image-paragraph-captioning Trains image paragraph captioning models to generate diverse and accurate captions 90