deepspeech.pytorch
Speech recognition system
A deep learning-based speech recognition system built on top of PyTorch Lightning.
Speech Recognition using DeepSpeech2.
2k stars
52 watching
620 forks
Language: Python
last commit: about 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
seannaren/deepspeech.torch | A speech recognition system based on the DeepSpeech2 architecture | 259 |
4uiiurz1/pytorch-res2net | Implementations of deep learning architectures using PyTorch for image classification tasks on various datasets. | 112 |
l0sg/relational-rnn-pytorch | An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling | 245 |
deepsound-project/samplernn-pytorch | An implementation of an audio generation model using PyTorch | 290 |
r9y9/deepvoice3_pytorch | An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch | 1,970 |
eduardoleao052/js-pytorch | A JavaScript library that provides GPU-accelerated deep learning capabilities with automatic differentiation and neural network layers. | 1,093 |
zhanghang1989/pytorch-encoding | A Python framework for building deep learning models with optimized encoding layers and batch normalization. | 2,044 |
javeywang/pyramid-attention-networks-pytorch | An implementation of a deep learning model using PyTorch for semantic segmentation tasks. | 237 |
tstandley/xception-pytorch | An implementation of a deep learning model using PyTorch and depthwise separable convolutions for image classification | 249 |
awni/speech | A PyTorch implementation of end-to-end speech recognition models. | 756 |
kazuto1011/pspnet-pytorch | Re-implementation of a deep learning model for semantic segmentation using PyTorch. | 52 |
potterhsu/svhnclassifier-pytorch | A PyTorch implementation of multi-digit number recognition from street view imagery using deep convolutional neural networks | 200 |
isht7/pytorch-deeplab-resnet | A deep learning model implementation of the DeepLab ResNet architecture for image segmentation tasks. | 602 |
tramac/fast-scnn-pytorch | A PyTorch implementation of a deep learning model for semantic segmentation tasks in computer vision. | 380 |
seannaren/warp-ctc | PyTorch bindings for the Warp-CTC loss function used in speech recognition. | 757 |