 lip-reading-deeplearning
 lip-reading-deeplearning 
 Lip reader
 Deep learning-based system for recognizing speech from lip movements using 3D convolutional neural networks.
 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
2k stars
 55 watching
 324 forks
 
Language: Python 
last commit: almost 3 years ago 
Linked from   1 awesome list  
  3d-convolutional-networkcomputer-visiondeep-learningspeech-recognitiontensorflow 
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | Develops deep learning models using 3D convolutional neural networks for speaker verification tasks | 783 | 
|  | A software implementation of a deep learning model designed to understand lip movements in videos | 117 | 
|  | Reconstructs audio features learned by convolutional neural networks into audible sounds | 42 | 
|  | A deep learning framework for scene text recognition with rectification and attention mechanisms. | 639 | 
|  | A library for learning audio embeddings from text and audio data using contrastive language-audio pretraining | 1,457 | 
|  | Provides tools and libraries for extracting speech features from audio data. | 881 | 
|  | A deep learning library for streamlining research and development using the Torch7 distribution. | 343 | 
|  | Enables speech-to-text transcription using a pre-trained Deep Speech model in MATLAB. | 7 | 
|  | A deep learning framework for human parsing that learns to detect human structures without explicit joint labeling. | 229 | 
|  | A software framework for 3D vision and computer vision tasks using deep learning and 2D keypoints. | 431 | 
|  | A deep learning library for image segmentation and object detection using PyTorch. | 1,054 | 
|  | A Python library that enables seamless interaction between deep learning frameworks and Lua/Torch libraries. | 234 | 
|  | A framework for unsupervised depth and ego-motion estimation from monocular videos using deep learning | 1,977 | 
|  | A collection of modular deep learning components that can be easily configured and reused in various applications. | 276 | 
|  | An implementation of a deep learning system for semantic image segmentation using a combination of convolutional neural networks and conditional random fields. | 239 |