AdaptiveAttention
Image Captioning Model
Adaptive attention mechanism for image captioning using visual sentinels
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
334 stars
13 watching
74 forks
Language: Jupyter Notebook
last commit: almost 7 years ago
Linked from 2 awesome lists
attention-mechanismimage-captioningtorch
Related projects:
Repository | Description | Stars |
---|---|---|
chapternewscu/image-captioning-with-semantic-attention | A deep learning model for generating image captions with semantic attention | 51 |
yunjey/show-attend-and-tell | Generates captions for images using an attention-based neural network | 907 |
contextualai/lens | Enhances language models to generate text based on visual descriptions of images | 351 |
jazzsaxmafia/show_attend_and_tell.tensorflow | A TensorFlow implementation of a neural caption generator using attention mechanisms. | 506 |
peteanderson80/bottom-up-attention | Trains a bottom-up attention model using Faster R-CNN and Visual Genome annotations for image captioning and VQA tasks | 1,433 |
lancopku/iais | This project proposes a novel method for calibrating attention distributions in multimodal models to improve contextualized representations of image-text pairs. | 30 |
deeprnn/image_captioning | This implementation allows users to generate captions from images using a neural network model with visual attention. | 786 |
luoweizhou/vlp | A project for pre-training models to support image captioning and question answering tasks. | 412 |
mikeswanson/jbwatchactivityindicator | Generates Apple Watch activity indicator images for animation | 529 |
lukemelas/image-paragraph-captioning | Trains image paragraph captioning models to generate diverse and accurate captions | 90 |
cshizhe/asg2cap | An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions | 200 |
fengyang0317/unsupervised_captioning | An unsupervised image captioning framework that allows generating captions from images without paired data. | 215 |
jiasenlu/hiecoattenvqa | A framework for training Hierarchical Co-Attention models for Visual Question Answering using preprocessed data and a specific image model. | 349 |
xiadingz/video-caption.pytorch | PyTorch implementation of video captioning, combining deep learning and computer vision techniques. | 401 |
yiwuzhong/sub-gc | A PyTorch implementation of image captioning models via scene graph decomposition. | 96 |