asg2cap

Image captioning model

An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions

Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., CVPR 2020, Oral).

GitHub

200 stars
9 watching
29 forks
Language: Python
last commit: about 2 years ago

Related projects:

Repository Description Stars
yiwuzhong/sub-gc A PyTorch implementation of image captioning models via scene graph decomposition. 96
ibm/max-image-caption-generator An image caption generation system utilizing machine learning models and deep neural networks. 84
chapternewscu/image-captioning-with-semantic-attention A deep learning model for generating image captions with semantic attention 51
jaywongwang/densevideocaptioning An implementation of a dense video captioning model with attention-based fusion and context gating 149
fengyang0317/unsupervised_captioning An unsupervised image captioning framework that allows generating captions from images without paired data. 215
kacky24/stylenet A PyTorch implementation of a framework for generating captions with styles for images and videos. 63
yangxuntu/sgae Automatically generates scene graphs from images to aid in image captioning tasks 220
zhegan27/semantic_compositional_nets A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks 70
yongliang-wu/explorecfg This project develops strategies to optimize in-context sequence configurations for Vision-Language few-shot learning, with a focus on exploring the effects of varying configurations on image-text pairs. 33
eladhoffer/captiongen A PyTorch-based tool for generating captions from images 128
contextualai/lens Enhances language models to generate text based on visual descriptions of images 352
apple2373/chainer-caption An image caption generation system using a neural network architecture with pre-trained models. 64
hszhao/pspnet A PyTorch implementation of a deep learning model for semantic image segmentation 1,598
mansimov/text2image A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words. 594
hasinhayder/imagecaptionhoveranimation A CSS3-based solution to create hover animations for image captions 354