TDD
Video descriptor extractor
A tool for extracting features from videos using deep convolutional descriptors
Trajectory-pooled Deep-Convolutional Descriptors
104 stars
15 watching
75 forks
Language: Matlab
last commit: over 7 years ago
Linked from 1 awesome list
action-recognitioncaffe
Related projects:
Repository | Description | Stars |
---|---|---|
wanglimin/untrimmednet | A system for recognizing and detecting actions in untrimmed videos using a weakly supervised learning approach. | 162 |
damo-nlp-sg/vcd | An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs | 222 |
csdms-contrib/dreich_algorithm | A C++ algorithm for extracting channel networks from high resolution topographic data | 1 |
reedscot/cvpr2016 | A system for learning deep representations of fine-grained visual descriptions from images | 336 |
neil-wu/swiftdump | A tool that extracts information about Swift objects from Mach-O files. | 400 |
xuchaoxi/video-cnn-feat | Extracts CNN features from video frames using pre-trained MXNet models | 31 |
wangboml/bp_features_extraction | A Matlab program for extracting features from three physiological signals (PPG, ECG, and BP) collected in synchronization. | 44 |
huaizhengzhang/awsome-deep-learning-for-video-analysis | A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques. | 767 |
drewnoakes/metadata-extractor-dotnet | A .NET library for extracting metadata from various image, video, and audio file formats. | 953 |
rozumden/defmo | A deep learning framework for deblurring and recovering the shape of fast-moving objects from blurred images | 171 |
tinghuiz/sfmlearner | A framework for unsupervised depth and ego-motion estimation from monocular videos using deep learning | 1,977 |
vision-cair/longvu | An artificial intelligence system designed to understand and describe long-form video content | 329 |
liuzhao1225/youdub-webui | A web-based video processing tool that uses AI to facilitate cultural and linguistic tasks such as transcription, translation, and audio synthesis. | 1,980 |
wangguanzhi/ladn | A deep learning-based framework for facial makeup transfer and removal using adversarial disentangling networks | 182 |
adbedada/ts-raster | Extracts and analyzes time-series characteristics from raster data using Python. | 4 |