TTS
Speech generator
A deep learning toolkit for generating human-like speech from text
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
35k stars
294 watching
4k forks
Language: Python
last commit: 3 months ago
Linked from 1 awesome list
deep-learningglow-ttshifiganmelganmulti-speaker-ttspythonpytorchspeaker-encoderspeaker-encodingsspeechspeech-synthesistacotrontext-to-speechttstts-modelvocodervoice-cloningvoice-conversionvoice-synthesis
Related projects:
Repository | Description | Stars |
---|---|---|
coqui-ai/stt | A toolkit for building and deploying speech-to-text models using deep learning techniques | 2,283 |
mozilla/tts | An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. | 9,401 |
camb-ai/mars5-tts | A deep learning-based speech synthesis model that generates natural-sounding audio with controlled prosody. | 2,530 |
rvc-boss/gpt-sovits | An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. | 35,728 |
ai-shifu/chatall | A platform that enables concurrent interaction with multiple AI chatbots to find the best answers. | 15,241 |
jasonppy/voicecraft | A neural codec model for speech editing and text-to-speech synthesis in real-time, using few seconds of reference audio. | 7,638 |
openvinotoolkit/openvino | A toolkit for optimizing and deploying artificial intelligence models in various applications | 7,279 |
plachtaa/vall-e-x | A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning | 7,670 |
tensorspeech/tensorflowtts | Real-time speech synthesis using state-of-the-art architectures | 3,839 |
deepseek-ai/deepseek-v2 | A high-performance mixture-of-experts language model with strong performance and efficient inference capabilities. | 3,590 |
huggingface/text-generation-inference | A toolkit for deploying and serving Large Language Models. | 9,106 |
microsoft/deepspeed | A deep learning optimization library that makes distributed training and inference easy, efficient, and effective. | 35,463 |
neonbjb/tortoise-tts | A multi-voice text-to-speech system trained on high-quality data | 13,225 |
conchylicultor/deepqa | A deep learning-based chatbot model using TensorFlow and RNNs to generate responses to user queries. | 2,934 |
parisneo/lollms-webui | An all-encompassing tool providing a web interface to access and utilize various AI models for tasks such as text generation, image analysis, music generation, and more. | 4,344 |