dual_encoding
Video text retrieval model
A deep learning project that provides a video-text retrieval model and tools for training and evaluating it on the MSR-VTT dataset
[CVPR2019] Dual Encoding for Zero-Example Video Retrieval
154 stars
7 watching
31 forks
Language: Python
last commit: about 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| Develops a deep learning framework for video retrieval using text and computer vision | 87 |
| A deep neural network architecture that predicts visual features from text to improve image and video caption retrieval | 69 |
| A deep learning-based video search system using pre-trained models and datasets | 28 |
| A framework for improving video retrieval by leveraging multiple text encoders and their collaborative expertise. | 337 |
| A deep learning model that jointly performs video frame interpolation and deblurring with unknown exposure time | 69 |
| Develops a cross-modal architecture for video retrieval by combining multiple types of features from videos and text | 259 |
| An efficient framework for end-to-end learning on image-text and video-text tasks | 709 |
| An open-source implementation of the Mixture-of-Embeddings-Experts model in Pytorch for video-text retrieval tasks. | 118 |
| An open-source implementation of a deep learning model for video deblurring and motion estimation. | 114 |
| An implementation of a video-text retrieval model using hierarchical graph reasoning with PyTorch. | 210 |
| An implementation of a dense video captioning model with attention-based fusion and context gating | 149 |
| Implementations of various deep learning algorithms in Python using Theano and Lasagne. | 24 |
| Trains and evaluates a universal multimodal retrieval model to perform various information retrieval tasks. | 114 |
| A collection of modular deep learning components that can be easily configured and reused in various applications. | 276 |
| A Python framework for building deep learning models with optimized encoding layers and batch normalization. | 2,044 |