densecap

Image describer

A deep learning framework for generating natural language descriptions of images by detecting objects and their attributes

Dense image captioning in Torch

GitHub

2k stars

68 watching

429 forks

Language: Jupyter Notebook

last commit: about 7 years ago

Linked from 2 awesome lists

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
jcjohnson/cnn-vis	This project enables users to generate images using convolutional neural networks (CNNs) and visualize their activations.	499
reinhardh/supplement_deep_decoder	This repository provides code for an image generating deep neural network designed to produce concise representations of images from few parameters.	96
jaywongwang/densevideocaptioning	An implementation of a dense video captioning model with attention-based fusion and context gating	149
vision-cair/chatcaptioner	Enables automatic generation of descriptive text from images and videos based on user input.	457
zhegan27/semantic_compositional_nets	A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks	70
ibm/max-image-caption-generator	An image caption generation system utilizing machine learning models and deep neural networks.	84
chapternewscu/image-captioning-with-semantic-attention	A deep learning model for generating image captions with semantic attention	51
deeprnn/image_captioning	This implementation allows users to generate captions from images using a neural network model with visual attention.	790
ucbdrive/dla	A software framework for deep learning-based image classification and segmentation tasks	434
vision-cair/longvu	An artificial intelligence system designed to understand and describe long-form video content	329
chxj1992/captcha_cracker	An image recognition system using a deep learning model to classify characters from verification codes	189
jhcho99/coformer	An implementation of a deep learning model for grounding situation recognition in images	45
zhujun98/semantic_segmentation	Implementations of deep learning architectures for semantic segmentation of images in various datasets.	6
codingjoe/django-pictures	A Django package for responsive image handling using modern formats like AVIF and WebP	251
contextualai/lens	Enhances language models to generate text based on visual descriptions of images	352