asg2cap
Image captioning model
An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions
Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., CVPR 2020, Oral).
200 stars
9 watching
29 forks
Language: Python
last commit: about 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
yiwuzhong/sub-gc | A PyTorch implementation of image captioning models via scene graph decomposition. | 96 |
ibm/max-image-caption-generator | An image caption generation system utilizing machine learning models and deep neural networks. | 84 |
chapternewscu/image-captioning-with-semantic-attention | A deep learning model for generating image captions with semantic attention | 51 |
jaywongwang/densevideocaptioning | An implementation of a dense video captioning model with attention-based fusion and context gating | 149 |
fengyang0317/unsupervised_captioning | An unsupervised image captioning framework that allows generating captions from images without paired data. | 215 |
kacky24/stylenet | A PyTorch implementation of a framework for generating captions with styles for images and videos. | 63 |
yangxuntu/sgae | Automatically generates scene graphs from images to aid in image captioning tasks | 220 |
zhegan27/semantic_compositional_nets | A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks | 70 |
yongliang-wu/explorecfg | This project develops strategies to optimize in-context sequence configurations for Vision-Language few-shot learning, with a focus on exploring the effects of varying configurations on image-text pairs. | 33 |
eladhoffer/captiongen | A PyTorch-based tool for generating captions from images | 128 |
contextualai/lens | Enhances language models to generate text based on visual descriptions of images | 352 |
apple2373/chainer-caption | An image caption generation system using a neural network architecture with pre-trained models. | 64 |
hszhao/pspnet | A PyTorch implementation of a deep learning model for semantic image segmentation | 1,598 |
mansimov/text2image | A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words. | 594 |
hasinhayder/imagecaptionhoveranimation | A CSS3-based solution to create hover animations for image captions | 354 |