TDD

Video descriptor extractor

A tool for extracting features from videos using deep convolutional descriptors

Trajectory-pooled Deep-Convolutional Descriptors

GitHub

104 stars
15 watching
75 forks
Language: Matlab
last commit: over 7 years ago
Linked from 1 awesome list

action-recognitioncaffe

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
wanglimin/untrimmednet A system for recognizing and detecting actions in untrimmed videos using a weakly supervised learning approach. 162
damo-nlp-sg/vcd An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs 222
csdms-contrib/dreich_algorithm A C++ algorithm for extracting channel networks from high resolution topographic data 1
reedscot/cvpr2016 A system for learning deep representations of fine-grained visual descriptions from images 336
neil-wu/swiftdump A tool that extracts information about Swift objects from Mach-O files. 400
xuchaoxi/video-cnn-feat Extracts CNN features from video frames using pre-trained MXNet models 31
wangboml/bp_features_extraction A Matlab program for extracting features from three physiological signals (PPG, ECG, and BP) collected in synchronization. 44
huaizhengzhang/awsome-deep-learning-for-video-analysis A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques. 767
drewnoakes/metadata-extractor-dotnet A .NET library for extracting metadata from various image, video, and audio file formats. 953
rozumden/defmo A deep learning framework for deblurring and recovering the shape of fast-moving objects from blurred images 171
tinghuiz/sfmlearner A framework for unsupervised depth and ego-motion estimation from monocular videos using deep learning 1,977
vision-cair/longvu An artificial intelligence system designed to understand and describe long-form video content 329
liuzhao1225/youdub-webui A web-based video processing tool that uses AI to facilitate cultural and linguistic tasks such as transcription, translation, and audio synthesis. 1,980
wangguanzhi/ladn A deep learning-based framework for facial makeup transfer and removal using adversarial disentangling networks 182
adbedada/ts-raster Extracts and analyzes time-series characteristics from raster data using Python. 4