CLIP_prefix_caption
Image captioning method
An approach to image captioning that leverages the CLIP model and fine-tunes a language model without requiring additional supervision or object annotation.
Simple image captioning model
1k stars
7 watching
220 forks
Language: Jupyter Notebook
last commit: 6 months ago Related projects:
Repository | Description | Stars |
---|---|---|
chapternewscu/image-captioning-with-semantic-attention | A deep learning model for generating image captions with semantic attention | 51 |
fengyang0317/unsupervised_captioning | An unsupervised image captioning framework that allows generating captions from images without paired data. | 215 |
luoweizhou/vlp | A project for pre-training models to support image captioning and question answering tasks. | 416 |
apple2373/chainer-caption | An image caption generation system using a neural network architecture with pre-trained models. | 64 |
lukemelas/image-paragraph-captioning | Trains image paragraph captioning models to generate diverse and accurate captions | 90 |
contextualai/lens | Enhances language models to generate text based on visual descriptions of images | 352 |
deeprnn/image_captioning | This implementation allows users to generate captions from images using a neural network model with visual attention. | 790 |
lumingyin/quickcaption | Automated captioning and transcription tool for video and audio files | 74 |
anonymousanoy/fohe | Automates the process of generating multiple rewritten image captions by fine-tuning large vision-language models | 8 |
cshizhe/asg2cap | An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions | 200 |
kdexd/virtex | A pretraining approach that uses semantically dense captions to learn visual representations and improve image understanding tasks. | 556 |
ruotianluo/imagecaptioning.pytorch | A Python-based framework for training and testing image captioning models using PyTorch. | 1,458 |
ibm/max-image-caption-generator | An image caption generation system utilizing machine learning models and deep neural networks. | 84 |
kacky24/stylenet | A PyTorch implementation of a framework for generating captions with styles for images and videos. | 63 |
yiwuzhong/sub-gc | A PyTorch implementation of image captioning models via scene graph decomposition. | 96 |