deepspeech.pytorch

Speech recognition system

A deep learning-based speech recognition system built on top of PyTorch Lightning.

Speech Recognition using DeepSpeech2.

GitHub

2k stars
51 watching
619 forks
Language: Python
last commit: almost 2 years ago

Related projects:

Repository Description Stars
seannaren/deepspeech.torch A speech recognition system based on the DeepSpeech2 architecture 259
4uiiurz1/pytorch-res2net Implementations of deep learning architectures using PyTorch for image classification tasks on various datasets. 112
l0sg/relational-rnn-pytorch An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling 244
deepsound-project/samplernn-pytorch An implementation of an audio generation model using PyTorch 288
r9y9/deepvoice3_pytorch An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch 1,969
eduardoleao052/js-pytorch A JavaScript library that provides GPU-accelerated deep learning capabilities with automatic differentiation and neural network layers. 1,084
zhanghang1989/pytorch-encoding A Python framework for building deep learning models with optimized encoding layers and batch normalization. 2,041
javeywang/pyramid-attention-networks-pytorch An implementation of a deep learning model using PyTorch for semantic segmentation tasks. 235
tstandley/xception-pytorch An implementation of a deep learning model using PyTorch and depthwise separable convolutions for image classification 250
awni/speech A PyTorch implementation of end-to-end speech recognition models. 754
kazuto1011/pspnet-pytorch Re-implementation of a deep learning model for semantic segmentation using PyTorch. 52
potterhsu/svhnclassifier-pytorch A PyTorch implementation of multi-digit number recognition from street view imagery using deep convolutional neural networks 200
isht7/pytorch-deeplab-resnet A deep learning model implementation of the DeepLab ResNet architecture for image segmentation tasks. 602
tramac/fast-scnn-pytorch A PyTorch implementation of a deep learning model for semantic segmentation tasks in computer vision. 381
seannaren/warp-ctc PyTorch bindings for the Warp-CTC loss function used in speech recognition. 757