espresso

ASR toolkit

An end-to-end neural speech recognition toolkit based on PyTorch and fairseq.

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

GitHub

942 stars
42 watching
116 forks
Language: Python
last commit: 3 months ago
asrend-to-endfairseqkaldipythonpytorchspeech-recognition

Related projects:

Repository Description Stars
asyml/texar-pytorch A toolkit providing easy-to-use machine learning modules and functionalities for natural language processing and text generation tasks 745
arjo129/uspeech A toolkit for speech recognition on Arduino using C++ 473
awni/speech A PyTorch implementation of end-to-end speech recognition models. 754
linto-ai/whisper-timestamped An extension of the Whisper model to predict word timestamps and confidence scores with improved accuracy 2,045
thecodrr/vspeech Provides an interface to Mozilla's DeepSpeech TensorFlow-based Speech-to-Text library using V bindings. 50
fuelen/owl A toolkit for building and customizing command-line user interfaces in Elixir. 436
seannaren/deepspeech.pytorch A deep learning-based speech recognition system built on top of PyTorch Lightning. 2,104
artfwo/aiosc A minimalistic Open Sound Control communication module using asyncio for network operations. 37
kinwaicheuk/nnaudio An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data 1,032
abitdodgy/gibran A natural language processing toolkit with tokenization and Levenshtein distance functionality 65
misaogura/flashtorch Toolkit for visualizing neural network behavior in PyTorch 734
kaiyangzhou/dassl.pytorch A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. 1,217
marl/pysox A Python wrapper around an audio signal processing library. 519
openseg-group/openseg.pytorch Provides a PyTorch implementation of several computer vision tasks including object detection, segmentation and parsing. 1,190
flagai-open/aquila2 Provides pre-trained language models and tools for fine-tuning and evaluation 437