image-paragraph-captioning
Caption generator
Trains image paragraph captioning models to generate diverse and accurate captions
[EMNLP 2018] Training for Diversity in Image Paragraph Captioning
90 stars
6 watching
23 forks
Language: Python
last commit: over 5 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| A PyTorch-based tool for generating captions from images | 128 |
| An investigation into bias in image captioning systems using a dataset and a new model design to mitigate this bias | 13 |
| A project for pre-training models to support image captioning and question answering tasks. | 416 |
| Enhances language models to generate text based on visual descriptions of images | 352 |
| An image caption generation system utilizing machine learning models and deep neural networks. | 84 |
| An image caption generation system using a neural network architecture with pre-trained models. | 64 |
| An unsupervised image captioning framework that allows generating captions from images without paired data. | 215 |
| A PyTorch implementation of a framework for generating captions with styles for images and videos. | 63 |
| A deep learning model for generating image captions with semantic attention | 51 |
| An approach to image captioning that leverages the CLIP model and fine-tunes a language model without requiring additional supervision or object annotation. | 1,326 |
| This implementation allows users to generate captions from images using a neural network model with visual attention. | 790 |
| A method for generating and evaluating video captions using adversarial inference, trained on large datasets of text and multimedia features. | 34 |
| An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions | 200 |
| A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words. | 594 |
| Automates the process of generating multiple rewritten image captions by fine-tuning large vision-language models | 8 |