lens

Image captioner

Enhances language models to generate text based on visual descriptions of images

This is the official repository for the LENS (Large Language Models Enhanced to See) system.

GitHub

352 stars

9 watching

12 forks

Language: Jupyter Notebook

last commit: over 2 years ago

Related projects:

Repository	Description	Stars
jiasenlu/adaptiveattention	Adaptive attention mechanism for image captioning using visual sentinels	335
chapternewscu/image-captioning-with-semantic-attention	A deep learning model for generating image captions with semantic attention	51
luoweizhou/vlp	A project for pre-training models to support image captioning and question answering tasks.	416
lukemelas/image-paragraph-captioning	Trains image paragraph captioning models to generate diverse and accurate captions	90
vision-cair/chatcaptioner	Enables automatic generation of descriptive text from images and videos based on user input.	457
fengyang0317/unsupervised_captioning	An unsupervised image captioning framework that allows generating captions from images without paired data.	215
cshizhe/asg2cap	An image caption generation model that uses abstract scene graphs to fine-grained control and generate captions	200
anonymousanoy/fohe	Automates the process of generating multiple rewritten image captions by fine-tuning large vision-language models	8
rmokady/clip_prefix_caption	An approach to image captioning that leverages the CLIP model and fine-tunes a language model without requiring additional supervision or object annotation.	1,326
byungkwanlee/moai	Improves performance of vision language tasks by integrating computer vision capabilities into large language models	314
isekai-portal/link-context-learning	An implementation of a multimodal learning approach to improve language models' ability to recognize unseen images and understand novel concepts.	91
nickjiang2378/vl-interp	This project provides an official PyTorch implementation of a method to interpret and edit vision-language representations to mitigate hallucinations in image captions.	46
luciancaetano/lens-ui	A React UI component library designed to be simple and customizable	8
stevenfontanella/microlens	A lightweight alternative to the lens library with fewer dependencies and no Template Haskell support	286
commaai/commacoloring	An online coloring book with interactive features	101