FOHE

Caption rewriting

Automates the process of generating multiple rewritten image captions by fine-tuning large vision-language models

8 stars

1 watching

1 forks

Language: Python

last commit: over 1 year ago

Related projects:

Repository	Description	Stars
fengyang0317/unsupervised_captioning	An unsupervised image captioning framework that allows generating captions from images without paired data.	215
nickjiang2378/vl-interp	This project provides an official PyTorch implementation of a method to interpret and edit vision-language representations to mitigate hallucinations in image captions.	46
chapternewscu/image-captioning-with-semantic-attention	A deep learning model for generating image captions with semantic attention	51
cshizhe/asg2cap	An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions	200
contextualai/lens	Enhances language models to generate text based on visual descriptions of images	352
a1ext/auto_re	Automates renaming and tagging of IDA PRO functions based on API calls or jumps	621
apple2373/chainer-caption	An image caption generation system using a neural network architecture with pre-trained models.	64
luoweizhou/vlp	A project for pre-training models to support image captioning and question answering tasks.	416
aboev/arae-tf	Automates generation of discrete sequence text using adversarially regularized autoencoders	20
rmokady/clip_prefix_caption	An approach to image captioning that leverages the CLIP model and fine-tunes a language model without requiring additional supervision or object annotation.	1,326
lukemelas/image-paragraph-captioning	Trains image paragraph captioning models to generate diverse and accurate captions	90
kacky24/stylenet	A PyTorch implementation of a framework for generating captions with styles for images and videos.	63
jakezhaojb/arae	An implementation of Adversarially Regularized Autoencoders for language generation and discrete structure modeling.	400
terrynoya/asimagelib	A decoder library for PNG and BMP images in ActionScript 3.	10
deeprnn/image_captioning	This implementation allows users to generate captions from images using a neural network model with visual attention.	790