hybrid_space
Video retrieval framework
Develops a deep learning framework for video retrieval using text and computer vision
Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.
87 stars
5 watching
17 forks
Language: Python
last commit: almost 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
danieljf24/dual_encoding | A deep learning project that provides a video-text retrieval model and tools for training and evaluating it on the MSR-VTT dataset | 155 |
albanie/collaborative-experts | A framework for improving video retrieval by leveraging multiple text encoders and their collaborative expertise. | 336 |
li-xirong/w2vvpp | A deep learning-based video search system using pre-trained models and datasets | 28 |
gabeur/mmt | Develops a cross-modal architecture for video retrieval by combining multiple types of features from videos and text | 258 |
cshizhe/hgr_v2t | An implementation of a video-text retrieval model using hierarchical graph reasoning with PyTorch. | 209 |
opengvlab/internvideo | Developing video foundation models and datasets for multimodal understanding and applications | 1,413 |
google-research/visu3d | An abstraction layer between various deep learning frameworks and your program. | 147 |
nicholas-leonard/dp | A deep learning library for streamlining research and development using the Torch7 distribution. | 343 |
gitcvfb/cvr | Reconstructs high-quality video frames from two adjacent rolling shutter camera frames | 31 |
yasar-rehman/fedvssl | Implementation of Federated Self-Superivised Learning for video understanding | 24 |
millionintegrals/vel | A collection of modular deep learning components that can be easily configured and reused in various applications. | 276 |
danieljf24/w2vv | A deep neural network architecture that predicts visual features from text to improve image and video caption retrieval | 69 |
dokterbob/youtube2ipfs | Tools for downloading and publishing video files from YouTube to IPFS | 21 |
hannes-brt/hebel | A deep learning library that provides GPU acceleration and various neural network models and training methods. | 1,169 |
sergiooramas/tartarus | A Python module for Deep Learning experiments on Audio and Text data combining classification, recommendation, and matrix factorization techniques. | 101 |