AdaptiveAttention

Image Captioning Model

Adaptive attention mechanism for image captioning using visual sentinels

Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"

GitHub

334 stars
13 watching
74 forks
Language: Jupyter Notebook
last commit: almost 7 years ago
Linked from 2 awesome lists

attention-mechanismimage-captioningtorch

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
chapternewscu/image-captioning-with-semantic-attention A deep learning model for generating image captions with semantic attention 51
yunjey/show-attend-and-tell Generates captions for images using an attention-based neural network 907
contextualai/lens Enhances language models to generate text based on visual descriptions of images 351
jazzsaxmafia/show_attend_and_tell.tensorflow A TensorFlow implementation of a neural caption generator using attention mechanisms. 506
peteanderson80/bottom-up-attention Trains a bottom-up attention model using Faster R-CNN and Visual Genome annotations for image captioning and VQA tasks 1,433
lancopku/iais This project proposes a novel method for calibrating attention distributions in multimodal models to improve contextualized representations of image-text pairs. 30
deeprnn/image_captioning This implementation allows users to generate captions from images using a neural network model with visual attention. 786
luoweizhou/vlp A project for pre-training models to support image captioning and question answering tasks. 412
mikeswanson/jbwatchactivityindicator Generates Apple Watch activity indicator images for animation 529
lukemelas/image-paragraph-captioning Trains image paragraph captioning models to generate diverse and accurate captions 90
cshizhe/asg2cap An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions 200
fengyang0317/unsupervised_captioning An unsupervised image captioning framework that allows generating captions from images without paired data. 215
jiasenlu/hiecoattenvqa A framework for training Hierarchical Co-Attention models for Visual Question Answering using preprocessed data and a specific image model. 349
xiadingz/video-caption.pytorch PyTorch implementation of video captioning, combining deep learning and computer vision techniques. 401
yiwuzhong/sub-gc A PyTorch implementation of image captioning models via scene graph decomposition. 96