pytorch-kaldi
Speech recognition toolkit
A toolkit for developing state-of-the-art deep learning-based speech recognition systems using PyTorch and Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
2k stars
94 watching
446 forks
Language: Python
last commit: over 2 years ago asrdeep-learningdeep-neural-networksdnndnn-hmmgrukaldilstmlstm-neural-networksmultilayer-perceptron-networkpytorchrecurrent-neural-networksrnnrnn-modelspeechspeech-recognitiontimit
Related projects:
Repository | Description | Stars |
---|---|---|
speechbrain/speechbrain | A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities. | 8,922 |
kaldi-asr/kaldi | A toolkit for speech recognition and processing of audio data | 14,287 |
opensource-spraakherkenning-nl/kaldi_nl | This project provides scripts and tools for speech recognition in Dutch using the Kaldi toolkit | 66 |
pyannote/pyannote-audio | A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,333 |
pytorch/audio | A PyTorch module providing tools and functions for audio signal processing | 2,538 |
cszn/kair | Image restoration toolbox with training and testing codes for various deep learning-based methods | 2,957 |
rhasspy/piper | A fast local neural text-to-speech system optimized for small devices | 6,576 |
seannaren/deepspeech.pytorch | A deep learning-based speech recognition system built on top of PyTorch Lightning. | 2,104 |
karpathy/neuraltalk | A system for generating image descriptions using neural networks | 5,411 |
kaiyangzhou/deep-person-reid | A PyTorch library for training and retraining deep neural networks for person re-identification in images and videos. | 4,318 |
layumi/person_reid_baseline_pytorch | A PyTorch implementation of an Object Re-ID baseline with various training methods and architectures | 4,126 |
namisan/mt-dnn | A PyTorch package implementing multi-task deep neural networks for natural language understanding | 2,238 |
kinwaicheuk/nnaudio | An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data | 1,032 |
bentrevett/pytorch-sentiment-analysis | A tutorial repository for building sentiment analysis models with PyTorch using various architectures and techniques. | 4,381 |
yunjey/pytorch-tutorial | A comprehensive tutorial project that provides code examples for learning PyTorch by implementing various deep learning models and demonstrating their usage. | 30,251 |