Lipreading-DenseNet3D

Lip movement analyzer

A software implementation of a deep learning model designed to understand lip movements in videos

DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

GitHub

117 stars

6 watching

21 forks

Language: Python

last commit: over 4 years ago

arxivdeeplearninglipreadingpytorch

Related projects:

Repository	Description	Stars
astorfi/lip-reading-deeplearning	Deep learning-based system for recognizing speech from lip movements using 3D convolutional neural networks.	1,840
millionintegrals/vel	A collection of modular deep learning components that can be easily configured and reused in various applications.	276
hualin95/deeplab-v3plus	A high-performance PyTorch implementation of semantic image segmentation using a custom encoder-decoder architecture.	334
astorfi/3d-convolutional-speaker-recognition	Develops deep learning models using 3D convolutional neural networks for speaker verification tasks	783
devendrachaplot/deeprl-grounding	Trains an RL agent to execute natural language instructions in a 3D environment using a combination of A3C and gated attention mechanisms.	237
vita-epfl/crowdnav	Develops robot navigation policies in crowded spaces using reinforcement learning and attention mechanisms.	607
codeslake/pvdnet	An open-source implementation of a deep learning model for video deblurring and motion estimation.	114
dvlab-research/prompt-highlighter	An interactive control system for text generation in multi-modal language models	135
clementpinard/sfmlearner-pytorch	Pytorch implementation of unsupervised depth and ego-motion learning from video sequences	1,022
foamliu/deep-image-matting-pytorch	An implementation of deep image matting in PyTorch using a neural network architecture.	821
engineering-course/lip_ssl	A deep learning framework for human parsing that learns to detect human structures without explicit joint labeling.	229
dvlab-research/lisa	A system that uses large language models to generate segmentation masks for images based on complex queries and world knowledge.	1,923
deepwisdom/autodl	Automated deep learning algorithm that performs feature engineering, model selection, and hyperparameter tuning without human intervention.	1,140
vita-epfl/monoloco	A software framework for 3D vision and computer vision tasks using deep learning and 2D keypoints.	431
nvidia-merlin/nvtabular	A library that provides a high-level abstraction for feature engineering and preprocessing of tabular data to accelerate deep learning recommender systems on GPUs.	1,057