CLIP_prefix_caption

Image captioning method

An approach to image captioning that leverages the CLIP model and fine-tunes a language model without requiring additional supervision or object annotation.

Simple image captioning model

GitHub

1k stars
7 watching
220 forks
Language: Jupyter Notebook
last commit: 6 months ago

Related projects:

Repository Description Stars
chapternewscu/image-captioning-with-semantic-attention A deep learning model for generating image captions with semantic attention 51
fengyang0317/unsupervised_captioning An unsupervised image captioning framework that allows generating captions from images without paired data. 215
luoweizhou/vlp A project for pre-training models to support image captioning and question answering tasks. 416
apple2373/chainer-caption An image caption generation system using a neural network architecture with pre-trained models. 64
lukemelas/image-paragraph-captioning Trains image paragraph captioning models to generate diverse and accurate captions 90
contextualai/lens Enhances language models to generate text based on visual descriptions of images 352
deeprnn/image_captioning This implementation allows users to generate captions from images using a neural network model with visual attention. 790
lumingyin/quickcaption Automated captioning and transcription tool for video and audio files 74
anonymousanoy/fohe Automates the process of generating multiple rewritten image captions by fine-tuning large vision-language models 8
cshizhe/asg2cap An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions 200
kdexd/virtex A pretraining approach that uses semantically dense captions to learn visual representations and improve image understanding tasks. 556
ruotianluo/imagecaptioning.pytorch A Python-based framework for training and testing image captioning models using PyTorch. 1,458
ibm/max-image-caption-generator An image caption generation system utilizing machine learning models and deep neural networks. 84
kacky24/stylenet A PyTorch implementation of a framework for generating captions with styles for images and videos. 63
yiwuzhong/sub-gc A PyTorch implementation of image captioning models via scene graph decomposition. 96