CAVP
Policy Network Framework
A software framework for fine-grained image captioning and sequence-level image captioning, utilizing policy networks to incorporate contextual information into image captions.
Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Network for Fine-Grained Image Captioning (TPAMI 2019)
46 stars
4 watching
3 forks
Language: Python
last commit: over 5 years ago image-captioningpolicy-network
Related projects:
Repository | Description | Stars |
---|---|---|
| An implementation of convolutional neural networks in Matlab, providing GPU-enabled vectorized processing for image recognition and processing tasks. | 136 |
| A deep learning framework providing a model architecture and training code for image captioning using semantic compositional networks | 70 |
| An implementation of guided policy search and LQG-based trajectory optimization for reinforcement learning | 599 |
| A deep learning framework for training highway networks on image data using convolutional neural networks | 57 |
| An implementation of a neural network framework for computer vision tasks, supporting both CPU and GPU computation. | 244 |
| Improves object detection by generating region proposals with increased adaptivity. | 156 |
| This implementation allows users to generate captions from images using a neural network model with visual attention. | 790 |
| A Python package for parsing and processing AWS IAM policies and statements. | 427 |
| A deep learning framework for iteratively decomposing vision and language reasoning via large language models. | 32 |
| An implementation of neural network components and optimization methods for text analysis, including rationales for neural predictions. | 355 |
| This code implements a neural network architecture designed to perform semantic segmentation in computer vision tasks. | 920 |
| A software framework for training and evaluating full-resolution residual networks for semantic image segmentation tasks | 280 |
| A deep learning framework implementing a specific network architecture for image localization tasks. | 537 |
| Unsupervised feature learning by image inpainting using Generative Adversarial Networks (GANs) | 887 |
| A framework for building computer vision and neural networks applications on the JavaVM | 122 |