pytorch-kaldi

Speech recognition toolkit

A toolkit for developing state-of-the-art deep learning-based speech recognition systems using PyTorch and Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

GitHub

2k stars
94 watching
446 forks
Language: Python
last commit: over 2 years ago
asrdeep-learningdeep-neural-networksdnndnn-hmmgrukaldilstmlstm-neural-networksmultilayer-perceptron-networkpytorchrecurrent-neural-networksrnnrnn-modelspeechspeech-recognitiontimit

Related projects:

Repository Description Stars
speechbrain/speechbrain A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities. 8,922
kaldi-asr/kaldi A toolkit for speech recognition and processing of audio data 14,287
opensource-spraakherkenning-nl/kaldi_nl This project provides scripts and tools for speech recognition in Dutch using the Kaldi toolkit 66
pyannote/pyannote-audio A toolkit for speaker diarization using PyTorch and speech activity detection. 6,333
pytorch/audio A PyTorch module providing tools and functions for audio signal processing 2,538
cszn/kair Image restoration toolbox with training and testing codes for various deep learning-based methods 2,957
rhasspy/piper A fast local neural text-to-speech system optimized for small devices 6,576
seannaren/deepspeech.pytorch A deep learning-based speech recognition system built on top of PyTorch Lightning. 2,104
karpathy/neuraltalk A system for generating image descriptions using neural networks 5,411
kaiyangzhou/deep-person-reid A PyTorch library for training and retraining deep neural networks for person re-identification in images and videos. 4,318
layumi/person_reid_baseline_pytorch A PyTorch implementation of an Object Re-ID baseline with various training methods and architectures 4,126
namisan/mt-dnn A PyTorch package implementing multi-task deep neural networks for natural language understanding 2,238
kinwaicheuk/nnaudio An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data 1,032
bentrevett/pytorch-sentiment-analysis A tutorial repository for building sentiment analysis models with PyTorch using various architectures and techniques. 4,381
yunjey/pytorch-tutorial A comprehensive tutorial project that provides code examples for learning PyTorch by implementing various deep learning models and demonstrating their usage. 30,251