Lipreading-DenseNet3D
Lip movement analyzer
A software implementation of a deep learning model designed to understand lip movements in videos
DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990
117 stars
6 watching
21 forks
Language: Python
last commit: almost 4 years ago arxivdeeplearninglipreadingpytorch
Related projects:
Repository | Description | Stars |
---|---|---|
astorfi/lip-reading-deeplearning | Deep learning-based system for recognizing speech from lip movements using 3D convolutional neural networks. | 1,836 |
millionintegrals/vel | A collection of modular deep learning components that can be easily configured and reused in various applications. | 276 |
hualin95/deeplab-v3plus | A high-performance PyTorch implementation of semantic image segmentation using a custom encoder-decoder architecture. | 334 |
astorfi/3d-convolutional-speaker-recognition | Develops deep learning models using 3D convolutional neural networks for speaker verification tasks | 783 |
devendrachaplot/deeprl-grounding | Trains an RL agent to execute natural language instructions in a 3D environment using a combination of A3C and gated attention mechanisms. | 237 |
vita-epfl/crowdnav | Develops robot navigation policies in crowded spaces using reinforcement learning and attention mechanisms. | 598 |
codeslake/pvdnet | An open-source implementation of a deep learning model for video deblurring and motion estimation. | 114 |
dvlab-research/prompt-highlighter | An interactive control system for text generation in multi-modal language models | 132 |
clementpinard/sfmlearner-pytorch | PyTorch implementation of unsupervised depth and ego-motion learning from video sequences | 1,014 |
foamliu/deep-image-matting-pytorch | An implementation of deep image matting in PyTorch using a neural network architecture. | 817 |
engineering-course/lip_ssl | A deep learning framework for human parsing that learns to detect human structures without explicit joint labeling. | 229 |
dvlab-research/lisa | A system that uses large language models to generate segmentation masks for images based on complex queries and world knowledge. | 1,861 |
deepwisdom/autodl | Automated deep learning algorithm that performs feature engineering, model selection, and hyperparameter tuning without human intervention. | 1,140 |
vita-epfl/monoloco | A library for 3D vision tasks using 2D keypoints | 428 |
nvidia-merlin/nvtabular | A library that provides a high-level abstraction for feature engineering and preprocessing of tabular data to accelerate deep learning recommender systems on GPUs. | 1,049 |