pytorch-kaldi

Speech recognizer

Develops state-of-the-art speech recognition systems using PyTorch and Kaldi toolkits

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

GitHub

2k stars
94 watching
446 forks
Language: Python
last commit: almost 3 years ago
asrdeep-learningdeep-neural-networksdnndnn-hmmgrukaldilstmlstm-neural-networksmultilayer-perceptron-networkpytorchrecurrent-neural-networksrnnrnn-modelspeechspeech-recognitiontimit

Related projects:

Repository Description Stars
speechbrain/speechbrain A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities. 9,066
kaldi-asr/kaldi A software toolkit for building speech recognition systems using various programming languages and platforms. 14,362
opensource-spraakherkenning-nl/kaldi_nl This project provides scripts and tools for speech recognition in Dutch using the Kaldi toolkit 66
pyannote/pyannote-audio A toolkit for speaker diarization using PyTorch and speech activity detection. 6,508
pytorch/audio A PyTorch module providing tools and functions for audio signal processing 2,561
cszn/kair Image restoration toolbox with training and testing codes for various deep learning-based methods 2,994
rhasspy/piper A fast local neural text-to-speech system optimized for small devices 7,002
seannaren/deepspeech.pytorch A deep learning-based speech recognition system built on top of PyTorch Lightning. 2,109
karpathy/neuraltalk A system for generating image descriptions using neural networks 5,414
kaiyangzhou/deep-person-reid A PyTorch library for training and retraining deep neural networks for person re-identification in images and videos. 4,353
layumi/person_reid_baseline_pytorch A PyTorch implementation of an Object Re-ID baseline with various training methods and architectures 4,149
namisan/mt-dnn A PyTorch package implementing multi-task deep neural networks for natural language understanding 2,238
kinwaicheuk/nnaudio An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data 1,036
bentrevett/pytorch-sentiment-analysis A tutorial repository for building sentiment analysis models with PyTorch using various architectures and techniques. 4,419
yunjey/pytorch-tutorial A comprehensive tutorial project that provides code examples for learning PyTorch by implementing various deep learning models and demonstrating their usage. 30,401