lip-reading-deeplearning
Lip reader
Deep learning-based system for recognizing speech from lip movements using 3D convolutional neural networks.
Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
2k stars
55 watching
324 forks
Language: Python
last commit: over 2 years ago
Linked from 1 awesome list
3d-convolutional-networkcomputer-visiondeep-learningspeech-recognitiontensorflow
Related projects:
Repository | Description | Stars |
---|---|---|
| Develops deep learning models using 3D convolutional neural networks for speaker verification tasks | 783 |
| A software implementation of a deep learning model designed to understand lip movements in videos | 117 |
| Reconstructs audio features learned by convolutional neural networks into audible sounds | 42 |
| A deep learning framework for scene text recognition with rectification and attention mechanisms. | 639 |
| A library for learning audio embeddings from text and audio data using contrastive language-audio pretraining | 1,457 |
| Provides tools and libraries for extracting speech features from audio data. | 881 |
| A deep learning library for streamlining research and development using the Torch7 distribution. | 343 |
| Enables speech-to-text transcription using a pre-trained Deep Speech model in MATLAB. | 7 |
| A deep learning framework for human parsing that learns to detect human structures without explicit joint labeling. | 229 |
| A software framework for 3D vision and computer vision tasks using deep learning and 2D keypoints. | 431 |
| A deep learning library for image segmentation and object detection using PyTorch. | 1,054 |
| A Python library that enables seamless interaction between deep learning frameworks and Lua/Torch libraries. | 234 |
| A framework for unsupervised depth and ego-motion estimation from monocular videos using deep learning | 1,977 |
| A collection of modular deep learning components that can be easily configured and reused in various applications. | 276 |
| An implementation of a deep learning system for semantic image segmentation using a combination of convolutional neural networks and conditional random fields. | 239 |