w2vvpp

Video search system

A deep learning-based video search system using pre-trained models and datasets

W2VV++: A fully deep learning solution for ad-hoc video search

GitHub

28 stars

8 watching

15 forks

Language: Python

last commit: about 1 year ago

Linked from 1 awesome list

avsdeep-learningquery-representationvideo-retrieval

Backlinks from these awesome lists:

danieljf24/awesome-video-text-retrieval

Related projects:

Repository	Description	Stars
nus-hpc-ai-lab/videosys	A comprehensive toolkit for high-performance video generation and processing	1,819
danieljf24/hybrid_space	Develops a deep learning framework for video retrieval using text and computer vision	87
danieljf24/dual_encoding	A deep learning project that provides a video-text retrieval model and tools for training and evaluating it on the MSR-VTT dataset	154
dvlab-research/llama-vid	An image-based language model that uses large language models to generate visual and text features from videos	748
vision-cair/longvu	An artificial intelligence system designed to understand and describe long-form video content	329
winlinvip/srs	A high-performance, real-time video server supporting multiple protocols and platforms.	716
xjunko/mpv-v	A basic video player written in Vlang using mpv.	28
danieljf24/w2vv	A deep neural network architecture that predicts visual features from text to improve image and video caption retrieval	69
albanie/collaborative-experts	A framework for improving video retrieval by leveraging multiple text encoders and their collaborative expertise.	337
opengvlab/internvideo	Develops general video foundation models and related datasets for multimodal understanding and generation through generative and discriminative learning.	1,467
libvips/lua-vips	A Lua binding for a fast image processing library with low memory needs.	129
liuzhao1225/youdub-webui	A web-based video processing tool that uses AI to facilitate cultural and linguistic tasks such as transcription, translation, and audio synthesis.	1,980
cvondrick/vatic	Tools for efficiently scaling up video annotation using crowdsourced marketplaces.	609
vcciv/blvd	A large-scale 5D semantics benchmark for autonomous driving	171
huaizhengzhang/awsome-deep-learning-for-video-analysis	A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques.	767