 image-captioning-with-semantic-attention
 image-captioning-with-semantic-attention 
 Image captioning model
 A deep learning model for generating image captions with semantic attention
51 stars
 6 watching
 17 forks
 
Language: Jupyter Notebook 
last commit: about 9 years ago  Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks | 70 | 
|  | This implementation allows users to generate captions from images using a neural network model with visual attention. | 790 | 
|  | Adaptive attention mechanism for image captioning using visual sentinels | 335 | 
|  | An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions | 200 | 
|  | An unsupervised image captioning framework that allows generating captions from images without paired data. | 215 | 
|  | Enhances language models to generate text based on visual descriptions of images | 352 | 
|  | An image caption generation system utilizing machine learning models and deep neural networks. | 84 | 
|  | Implementations of deep learning architectures for semantic segmentation of images in various datasets. | 6 | 
|  | An approach to image captioning that leverages the CLIP model and fine-tunes a language model without requiring additional supervision or object annotation. | 1,326 | 
|  | Generates captions for images using an attention-based neural network | 907 | 
|  | An image caption generation system using a neural network architecture with pre-trained models. | 64 | 
|  | An implementation of a dense video captioning model with attention-based fusion and context gating | 149 | 
|  | An implementation of a deep learning model for semantic segmentation using a novel attention mechanism to capture long-range dependencies in images. | 1,432 | 
|  | An open-source implementation of an image segmentation model that combines background removal and object detection capabilities. | 1,484 | 
|  | Trains image paragraph captioning models to generate diverse and accurate captions | 90 |