w2vvpp
Video search system
A deep learning-based video search system using pre-trained models and datasets
W2VV++: A fully deep learning solution for ad-hoc video search
28 stars
8 watching
15 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list
avsdeep-learningquery-representationvideo-retrieval
Related projects:
Repository | Description | Stars |
---|---|---|
nus-hpc-ai-lab/videosys | A comprehensive toolkit for high-performance video generation and processing | 1,819 |
danieljf24/hybrid_space | Develops a deep learning framework for video retrieval using text and computer vision | 87 |
danieljf24/dual_encoding | A deep learning project that provides a video-text retrieval model and tools for training and evaluating it on the MSR-VTT dataset | 154 |
dvlab-research/llama-vid | An image-based language model that uses large language models to generate visual and text features from videos | 748 |
vision-cair/longvu | An artificial intelligence system designed to understand and describe long-form video content | 329 |
winlinvip/srs | A high-performance, real-time video server supporting multiple protocols and platforms. | 716 |
xjunko/mpv-v | A basic video player written in Vlang using mpv. | 28 |
danieljf24/w2vv | A deep neural network architecture that predicts visual features from text to improve image and video caption retrieval | 69 |
albanie/collaborative-experts | A framework for improving video retrieval by leveraging multiple text encoders and their collaborative expertise. | 337 |
opengvlab/internvideo | Develops general video foundation models and related datasets for multimodal understanding and generation through generative and discriminative learning. | 1,467 |
libvips/lua-vips | A Lua binding for a fast image processing library with low memory needs. | 129 |
liuzhao1225/youdub-webui | A web-based video processing tool that uses AI to facilitate cultural and linguistic tasks such as transcription, translation, and audio synthesis. | 1,980 |
cvondrick/vatic | Tools for efficiently scaling up video annotation using crowdsourced marketplaces. | 609 |
vcciv/blvd | A large-scale 5D semantics benchmark for autonomous driving | 171 |
huaizhengzhang/awsome-deep-learning-for-video-analysis | A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques. | 767 |