Lipreading-DenseNet3D

Lip movement analyzer

A software implementation of a deep learning model designed to understand lip movements in videos

DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

GitHub

117 stars
6 watching
21 forks
Language: Python
last commit: almost 4 years ago
arxivdeeplearninglipreadingpytorch

Related projects:

Repository Description Stars
astorfi/lip-reading-deeplearning Deep learning-based system for recognizing speech from lip movements using 3D convolutional neural networks. 1,833
millionintegrals/vel A collection of modular deep learning components that can be easily configured and reused in various applications. 276
hualin95/deeplab-v3plus A high-performance PyTorch implementation of semantic image segmentation using a custom encoder-decoder architecture. 334
astorfi/3d-convolutional-speaker-recognition Develops deep learning models using 3D convolutional neural networks for speaker verification tasks 782
devendrachaplot/deeprl-grounding Trains an RL agent to execute natural language instructions in a 3D environment using a combination of A3C and gated attention mechanisms. 237
vita-epfl/crowdnav Develops robot navigation policies in crowded spaces using reinforcement learning and attention mechanisms. 598
codeslake/pvdnet An open-source implementation of a deep learning model for video deblurring and motion estimation. 114
dvlab-research/prompt-highlighter An interactive control system for text generation in multi-modal language models 132
clementpinard/sfmlearner-pytorch PyTorch implementation of unsupervised depth and ego-motion learning from video sequences 1,014
foamliu/deep-image-matting-pytorch An implementation of deep image matting in PyTorch using a neural network architecture. 817
engineering-course/lip_ssl A deep learning framework for human parsing that learns to detect human structures without explicit joint labeling. 229
dvlab-research/lisa A system that uses large language models to generate segmentation masks for images based on complex queries and world knowledge. 1,861
deepwisdom/autodl Automated deep learning algorithm that performs feature engineering, model selection, and hyperparameter tuning without human intervention. 1,140
vita-epfl/monoloco A library for 3D vision tasks using 2D keypoints 428
nvidia-merlin/nvtabular A library that provides a high-level abstraction for feature engineering and preprocessing of tabular data to accelerate deep learning recommender systems on GPUs. 1,049