deepspeech.pytorch
Speech recognition system
A deep learning-based speech recognition system built on top of PyTorch Lightning.
Speech Recognition using DeepSpeech2.
2k stars
51 watching
619 forks
Language: Python
last commit: almost 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
seannaren/deepspeech.torch | A speech recognition system based on the DeepSpeech2 architecture | 259 |
4uiiurz1/pytorch-res2net | Implementations of deep learning architectures using PyTorch for image classification tasks on various datasets. | 112 |
l0sg/relational-rnn-pytorch | An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling | 244 |
deepsound-project/samplernn-pytorch | An implementation of an audio generation model using PyTorch | 288 |
r9y9/deepvoice3_pytorch | An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch | 1,969 |
eduardoleao052/js-pytorch | A JavaScript library that provides GPU-accelerated deep learning capabilities with automatic differentiation and neural network layers. | 1,084 |
zhanghang1989/pytorch-encoding | A Python framework for building deep learning models with optimized encoding layers and batch normalization. | 2,041 |
javeywang/pyramid-attention-networks-pytorch | An implementation of a deep learning model using PyTorch for semantic segmentation tasks. | 235 |
tstandley/xception-pytorch | An implementation of a deep learning model using PyTorch and depthwise separable convolutions for image classification | 250 |
awni/speech | A PyTorch implementation of end-to-end speech recognition models. | 754 |
kazuto1011/pspnet-pytorch | Re-implementation of a deep learning model for semantic segmentation using PyTorch. | 52 |
potterhsu/svhnclassifier-pytorch | A PyTorch implementation of multi-digit number recognition from street view imagery using deep convolutional neural networks | 200 |
isht7/pytorch-deeplab-resnet | A deep learning model implementation of the DeepLab ResNet architecture for image segmentation tasks. | 602 |
tramac/fast-scnn-pytorch | A PyTorch implementation of a deep learning model for semantic segmentation tasks in computer vision. | 381 |
seannaren/warp-ctc | PyTorch bindings for the Warp-CTC loss function used in speech recognition. | 757 |