espnet
Speech processing toolkit
A toolkit for end-to-end speech processing with deep learning and Kaldi-style data processing
End-to-End Speech Processing Toolkit
9k stars
181 watching
2k forks
Language: Python
last commit: over 1 year ago
Linked from 2 awesome lists
chainerdeep-learningend-to-endkaldimachine-translationpytorchsinging-voice-synthesisspeaker-diarizationspeech-enhancementspeech-recognitionspeech-separationspeech-synthesisspeech-translationspoken-language-understandingtext-to-speechvoice-conversion
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A PyTorch module providing tools and functions for audio signal processing | 2,561 |
| | An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch | 1,970 |
| | A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities. | 9,066 |
| | An approach to leveraging pre-trained models for efficient speech processing tasks by using prompt tuning | 97 |
| | An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks | 6,251 |
| | A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,508 |
| | A toolkit providing easy-to-use machine learning modules and functionalities for natural language processing and text generation tasks | 745 |
| | Develops state-of-the-art speech recognition systems using PyTorch and Kaldi toolkits | 2,370 |
| | This PyTorch implementation provides a toolkit for speech synthesis using a deep neural network architecture. | 5,123 |
| | An end-to-end neural speech recognition toolkit based on PyTorch and fairseq. | 941 |
| | A comprehensive Python library for feature extraction, classification, segmentation, and applications of audio data. | 5,918 |
| | A comprehensive toolkit for natural language processing tasks in Python. | 13,694 |
| | A toolkit for training custom sequence-to-sequence models for various NLP tasks | 30,675 |
| | A deep learning-based speech recognition system built on top of PyTorch Lightning. | 2,109 |
| | A PyTorch-based toolbox for building and training semantic segmentation models | 408 |