tortoise-tts

TTS system

An open-source text-to-speech system trained with high-quality audio capabilities

A multi-voice TTS system trained with an emphasis on quality

GitHub

13k stars
174 watching
2k forks
Language: Jupyter Notebook
last commit: 2 months ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mozilla/tts An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. 9,466
jasonppy/voicecraft A neural codec model for speech editing and text-to-speech synthesis in real-time, using few seconds of reference audio. 7,744
rvc-boss/gpt-sovits An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. 36,977
tensorspeech/tensorflowtts Real-time speech synthesis using state-of-the-art architectures 3,855
coqui-ai/tts A deep learning toolkit for generating human-like speech from text 36,118
metavoiceio/metavoice-src A deep learning model for generating human-like speech 3,936
huggingface/text-generation-inference A toolkit for deploying and serving Large Language Models (LLMs) for high-performance text generation 9,456
camb-ai/mars5-tts A deep learning-based speech synthesis model that generates natural-sounding audio with controlled prosody. 2,551
coqui-ai/stt A toolkit for building and deploying speech-to-text models using deep learning techniques 2,302
google/sentencepiece An unsupervised text tokenizer that segments input text into subwords and detokenizes output based on a predefined vocabulary size. 10,366
openai/whisper A general-purpose speech recognition system trained on large-scale weak supervision 72,752
nvidia/tacotron2 This PyTorch implementation provides a toolkit for speech synthesis using a deep neural network architecture. 5,123
eleutherai/gpt-neox Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. 6,997
opennmt/ctranslate2 A high-performance inference engine for transformer models 3,467
jaywalnut310/vits Develops an end-to-end text-to-speech system that generates more natural audio than existing models 6,947