neuraltalk2
Captioning model
Efficient image captioning model using a CNN followed by an RNN in deep learning on GPU
Efficient Image Captioning code in Torch, runs on GPU
6k stars
274 watching
1k forks
Language: Jupyter Notebook
last commit: about 7 years ago
Linked from 4 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
karpathy/neuraltalk | A system for generating image descriptions using neural networks | 5,411 |
donnyyou/torchcv | A comprehensive PyTorch-based framework for computer vision tasks | 2,250 |
openai/clip | A neural network trained on image and text pairs to predict the most relevant text snippet given an image | 25,919 |
lucidrains/dalle2-pytorch | An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch | 11,148 |
conchylicultor/deepqa | A deep learning-based chatbot model using TensorFlow and RNNs to generate responses to user queries. | 2,934 |
lucidrains/imagen-pytorch | Implements Google's Text-to-Image Neural Network in PyTorch using a cascading DDPM architecture with dynamic clipping and noise level conditioning. | 8,088 |
karpathy/char-rnn | A framework for training and generating text sequences using multi-layer Recurrent Neural Networks | 11,632 |
nvlabs/instant-ngp | A software toolkit for training and rendering neural graphics primitives | 16,033 |
ai-forever/kandinsky-2 | A multilingual text2image latent diffusion model with improved aesthetics and controllability | 2,766 |
tensorpack/tensorpack | A high-performance neural network training interface for TensorFlow that focuses on speed and flexibility. | 6,303 |
jwyang/faster-rcnn.pytorch | An object detection implementation built on top of PyTorch, supporting multi-image batch training and multiple GPUs. | 7,693 |
haotian-liu/llava | A system that uses large language and vision models to generate and process visual instructions | 20,232 |
clovaai/stargan-v2 | A Python implementation of an image-to-image translation model for generating diverse images across multiple domains. | 3,500 |
systemerrorwang/white-box-cartoonization | An implementation of a deep learning-based facial cartoonization system using TensorFlow | 3,958 |
apple/corenet | A deep neural network toolkit allowing researchers and engineers to train various models | 6,981 |