collaborative-experts

Video retrieval framework

A framework for improving video retrieval by leveraging multiple text encoders and their collaborative expertise.

Video embeddings for retrieval with natural language queries

GitHub

336 stars
10 watching
55 forks
Language: Python
last commit: almost 2 years ago
Linked from 1 awesome list

deep-neural-networksvideo-retrieval

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
danieljf24/hybrid_space Develops a deep learning framework for video retrieval using text and computer vision 87
danieljf24/dual_encoding A deep learning project that provides a video-text retrieval model and tools for training and evaluating it on the MSR-VTT dataset 155
li-xirong/w2vvpp A deep learning-based video search system using pre-trained models and datasets 28
antoine77340/mixture-of-embedding-experts An open-source implementation of the Mixture-of-Embeddings-Experts model in Pytorch for video-text retrieval tasks. 118
gabeur/mmt Develops a cross-modal architecture for video retrieval by combining multiple types of features from videos and text 258
opengvlab/internvideo Developing video foundation models and datasets for multimodal understanding and applications 1,413
idsia/brainstorm A neural network framework designed to make working with neural networks fast and flexible. 1,303
shangwei5/d2net Develops a framework to deblur video by leveraging non-consecutively blurry frames and proposes an event fusion module for improving deblurring performance. 35
google-research/visu3d An abstraction layer between various deep learning frameworks and your program. 148
allegro/allrank A framework for training neural models to rank data items based on relevance 874
xiangwang1223/neural_graph_collaborative_filtering A Python implementation of a graph neural network-based collaborative filtering framework for personalized recommendation systems 806
gsig/pyvideoresearch A collection of video analysis methods and datasets for research and development 533
rese1f/moviechat A deep learning model designed to efficiently process and analyze long videos using large language models 525
alejandro-isaza/caffe A C++ implementation of a deep learning framework designed for speed and modularity. 59
vision-cair/longvu An artificial intelligence system designed to understand and describe long-form video content 270