collaborative-experts
Video retrieval framework
A framework for improving video retrieval by leveraging multiple text encoders and their collaborative expertise.
Video embeddings for retrieval with natural language queries
336 stars
10 watching
55 forks
Language: Python
last commit: almost 2 years ago
Linked from 1 awesome list
deep-neural-networksvideo-retrieval
Related projects:
Repository | Description | Stars |
---|---|---|
danieljf24/hybrid_space | Develops a deep learning framework for video retrieval using text and computer vision | 87 |
danieljf24/dual_encoding | A deep learning project that provides a video-text retrieval model and tools for training and evaluating it on the MSR-VTT dataset | 155 |
li-xirong/w2vvpp | A deep learning-based video search system using pre-trained models and datasets | 28 |
antoine77340/mixture-of-embedding-experts | An open-source implementation of the Mixture-of-Embeddings-Experts model in Pytorch for video-text retrieval tasks. | 118 |
gabeur/mmt | Develops a cross-modal architecture for video retrieval by combining multiple types of features from videos and text | 258 |
opengvlab/internvideo | Developing video foundation models and datasets for multimodal understanding and applications | 1,413 |
idsia/brainstorm | A neural network framework designed to make working with neural networks fast and flexible. | 1,303 |
shangwei5/d2net | Develops a framework to deblur video by leveraging non-consecutively blurry frames and proposes an event fusion module for improving deblurring performance. | 35 |
google-research/visu3d | An abstraction layer between various deep learning frameworks and your program. | 148 |
allegro/allrank | A framework for training neural models to rank data items based on relevance | 874 |
xiangwang1223/neural_graph_collaborative_filtering | A Python implementation of a graph neural network-based collaborative filtering framework for personalized recommendation systems | 806 |
gsig/pyvideoresearch | A collection of video analysis methods and datasets for research and development | 533 |
rese1f/moviechat | A deep learning model designed to efficiently process and analyze long videos using large language models | 525 |
alejandro-isaza/caffe | A C++ implementation of a deep learning framework designed for speed and modularity. | 59 |
vision-cair/longvu | An artificial intelligence system designed to understand and describe long-form video content | 270 |