densecap

Image describer

A deep learning framework for generating natural language descriptions of images by detecting objects and their attributes

Dense image captioning in Torch

GitHub

2k stars
68 watching
429 forks
Language: Jupyter Notebook
last commit: over 6 years ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
jcjohnson/cnn-vis This project enables users to generate images using convolutional neural networks (CNNs) and visualize their activations. 496
reinhardh/supplement_deep_decoder A Python codebase for generating images from few parameters using an untrained non-convolutional deep neural network. 96
jaywongwang/densevideocaptioning An implementation of a dense video captioning model with attention-based fusion and context gating 148
vision-cair/chatcaptioner Enables automatic generation of descriptive text from images and videos based on user input. 452
zhegan27/semantic_compositional_nets A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks 70
ibm/max-image-caption-generator An image caption generation system utilizing machine learning models and deep neural networks. 84
chapternewscu/image-captioning-with-semantic-attention A deep learning model for generating image captions with semantic attention 51
deeprnn/image_captioning This implementation allows users to generate captions from images using a neural network model with visual attention. 786
ucbdrive/dla A software framework for deep learning-based image classification and segmentation tasks 434
vision-cair/longvu An artificial intelligence system designed to understand and describe long-form video content 270
chxj1992/captcha_cracker An image recognition system using a deep learning model to classify characters from verification codes 188
jhcho99/coformer An implementation of a deep learning model for grounding situation recognition in images 43
zhujun98/semantic_segmentation Implementations of deep learning architectures for semantic segmentation of images in various datasets. 6
codingjoe/django-pictures A Django package for responsive image handling using modern formats like AVIF and WebP 249
contextualai/lens Enhances language models to generate text based on visual descriptions of images 351