w2vvpp

Video search system

A deep learning-based video search system using pre-trained models and datasets

W2VV++: A fully deep learning solution for ad-hoc video search

GitHub

28 stars
8 watching
15 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list

avsdeep-learningquery-representationvideo-retrieval

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
nus-hpc-ai-lab/videosys A comprehensive toolkit for high-performance video generation and processing 1,819
danieljf24/hybrid_space Develops a deep learning framework for video retrieval using text and computer vision 87
danieljf24/dual_encoding A deep learning project that provides a video-text retrieval model and tools for training and evaluating it on the MSR-VTT dataset 154
dvlab-research/llama-vid An image-based language model that uses large language models to generate visual and text features from videos 748
vision-cair/longvu An artificial intelligence system designed to understand and describe long-form video content 329
winlinvip/srs A high-performance, real-time video server supporting multiple protocols and platforms. 716
xjunko/mpv-v A basic video player written in Vlang using mpv. 28
danieljf24/w2vv A deep neural network architecture that predicts visual features from text to improve image and video caption retrieval 69
albanie/collaborative-experts A framework for improving video retrieval by leveraging multiple text encoders and their collaborative expertise. 337
opengvlab/internvideo Develops general video foundation models and related datasets for multimodal understanding and generation through generative and discriminative learning. 1,467
libvips/lua-vips A Lua binding for a fast image processing library with low memory needs. 129
liuzhao1225/youdub-webui A web-based video processing tool that uses AI to facilitate cultural and linguistic tasks such as transcription, translation, and audio synthesis. 1,980
cvondrick/vatic Tools for efficiently scaling up video annotation using crowdsourced marketplaces. 609
vcciv/blvd A large-scale 5D semantics benchmark for autonomous driving 171
huaizhengzhang/awsome-deep-learning-for-video-analysis A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques. 767