CAVP

Policy Network Framework

A software framework for fine-grained image captioning and sequence-level image captioning, utilizing policy networks to incorporate contextual information into image captions.

Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Network for Fine-Grained Image Captioning (TPAMI 2019)

GitHub

46 stars

4 watching

3 forks

Language: Python

last commit: almost 6 years ago

image-captioningpolicy-network

Related projects:

Repository	Description	Stars
jimmy-ren/vcnn_double-bladed	An implementation of convolutional neural networks in Matlab, providing GPU-enabled vectorized processing for image recognition and processing tasks.	136
zhegan27/semantic_compositional_nets	A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks	70
cbfinn/gps	An implementation of guided policy search and LQG-based trajectory optimization for reinforcement learning	599
fomorians/highway-cnn	A deep learning framework for training highway networks on image data using convolutional neural networks	57
guanghan/darknet	An implementation of a neural network framework for computer vision tasks, supporting both CPU and GPU computation.	244
hciilab/derpn	Improves object detection by generating region proposals with increased adaptivity.	156
deeprnn/image_captioning	This implementation allows users to generate captions from images using a neural network model with visual attention.	790
netflix-skunkworks/policyuniverse	A Python package for parsing and processing AWS IAM policies and statements.	427
hxyou/idealgpt	A deep learning framework for iteratively decomposing vision and language reasoning via large language models.	32
taolei87/rcnn	An implementation of neural network components and optimization methods for text analysis, including rationales for neural predictions.	355
nv-tlabs/gscnn	This code implements a neural network architecture designed to perform semantic segmentation in computer vision tasks.	920
tobypde/frrn	A software framework for training and evaluating full-resolution residual networks for semantic image segmentation tasks	280
cypw/dpns	A deep learning framework implementing a specific network architecture for image localization tasks.	537
pathak22/context-encoder	Unsupervised feature learning by image inpainting using Generative Adversarial Networks (GANs)	887
hellonico/origami	A framework for building computer vision and neural networks applications on the JavaVM	122