image-captioning-with-semantic-attention
Image captioning model
A deep learning model for generating image captions with semantic attention
51 stars
6 watching
17 forks
Language: Jupyter Notebook
last commit: about 8 years ago Related projects:
Repository | Description | Stars |
---|---|---|
zhegan27/semantic_compositional_nets | A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks | 70 |
deeprnn/image_captioning | This implementation allows users to generate captions from images using a neural network model with visual attention. | 786 |
jiasenlu/adaptiveattention | Adaptive attention mechanism for image captioning using visual sentinels | 334 |
cshizhe/asg2cap | An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions | 200 |
fengyang0317/unsupervised_captioning | An unsupervised image captioning framework that allows generating captions from images without paired data. | 215 |
contextualai/lens | Enhances language models to generate text based on visual descriptions of images | 351 |
ibm/max-image-caption-generator | An image caption generation system utilizing machine learning models and deep neural networks. | 84 |
zhujun98/semantic_segmentation | Implementations of deep learning architectures for semantic segmentation of images in various datasets. | 6 |
rmokady/clip_prefix_caption | An approach to image captioning that leverages the CLIP model and fine-tunes a language model without requiring additional supervision or object annotation. | 1,318 |
yunjey/show-attend-and-tell | Generates captions for images using an attention-based neural network | 907 |
apple2373/chainer-caption | An image caption generation system using a neural network architecture with pre-trained models. | 64 |
jaywongwang/densevideocaptioning | An implementation of a dense video captioning model with attention-based fusion and context gating | 148 |
speedinghzl/ccnet | An implementation of a deep learning model for semantic segmentation using a novel attention mechanism to capture long-range dependencies in images. | 1,426 |
zhengpeng7/birefnet | This repository provides a software framework and implementation of a neural network model for high-resolution image segmentation tasks | 1,379 |
lukemelas/image-paragraph-captioning | Trains image paragraph captioning models to generate diverse and accurate captions | 90 |