hybrid_space

Video retrieval framework

Develops a deep learning framework for video retrieval using text and computer vision

Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

GitHub

87 stars
5 watching
17 forks
Language: Python
last commit: almost 2 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
danieljf24/dual_encoding A deep learning project that provides a video-text retrieval model and tools for training and evaluating it on the MSR-VTT dataset 155
albanie/collaborative-experts A framework for improving video retrieval by leveraging multiple text encoders and their collaborative expertise. 336
li-xirong/w2vvpp A deep learning-based video search system using pre-trained models and datasets 28
gabeur/mmt Develops a cross-modal architecture for video retrieval by combining multiple types of features from videos and text 258
cshizhe/hgr_v2t An implementation of a video-text retrieval model using hierarchical graph reasoning with PyTorch. 209
opengvlab/internvideo Developing video foundation models and datasets for multimodal understanding and applications 1,413
google-research/visu3d An abstraction layer between various deep learning frameworks and your program. 147
nicholas-leonard/dp A deep learning library for streamlining research and development using the Torch7 distribution. 343
gitcvfb/cvr Reconstructs high-quality video frames from two adjacent rolling shutter camera frames 31
yasar-rehman/fedvssl Implementation of Federated Self-Superivised Learning for video understanding 24
millionintegrals/vel A collection of modular deep learning components that can be easily configured and reused in various applications. 276
danieljf24/w2vv A deep neural network architecture that predicts visual features from text to improve image and video caption retrieval 69
dokterbob/youtube2ipfs Tools for downloading and publishing video files from YouTube to IPFS 21
hannes-brt/hebel A deep learning library that provides GPU acceleration and various neural network models and training methods. 1,169
sergiooramas/tartarus A Python module for Deep Learning experiments on Audio and Text data combining classification, recommendation, and matrix factorization techniques. 101