tortoise-tts

TTS system

An open-source text-to-speech system trained with high-quality audio capabilities

A multi-voice TTS system trained with an emphasis on quality

GitHub

13k stars

174 watching

2k forks

Language: Jupyter Notebook

last commit: over 1 year ago

Linked from 2 awesome lists

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
mozilla/tts	An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis.	9,466
jasonppy/voicecraft	A neural codec model for speech editing and text-to-speech synthesis in real-time, using few seconds of reference audio.	7,744
rvc-boss/gpt-sovits	An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models.	36,977
tensorspeech/tensorflowtts	Real-time speech synthesis using state-of-the-art architectures	3,855
coqui-ai/tts	A deep learning toolkit for generating human-like speech from text	36,118
metavoiceio/metavoice-src	A deep learning model for generating human-like speech	3,936
huggingface/text-generation-inference	A toolkit for deploying and serving Large Language Models (LLMs) for high-performance text generation	9,456
camb-ai/mars5-tts	A deep learning-based speech synthesis model that generates natural-sounding audio with controlled prosody.	2,551
coqui-ai/stt	A toolkit for building and deploying speech-to-text models using deep learning techniques	2,302
google/sentencepiece	An unsupervised text tokenizer that segments input text into subwords and detokenizes output based on a predefined vocabulary size.	10,366
openai/whisper	A general-purpose speech recognition system trained on large-scale weak supervision	72,752
nvidia/tacotron2	This PyTorch implementation provides a toolkit for speech synthesis using a deep neural network architecture.	5,123
eleutherai/gpt-neox	Provides a framework for training large-scale language models on GPUs with advanced features and optimizations.	6,997
opennmt/ctranslate2	A high-performance inference engine for transformer models	3,467
jaywalnut310/vits	Develops an end-to-end text-to-speech system that generates more natural audio than existing models	6,947