metavoice-src
Speech synthesizer
A deep learning model for generating human-like speech
Foundational model for human-like, expressive TTS
4k stars
80 watching
664 forks
Language: Python
last commit: 7 months ago aideep-learningpytorchspeechspeech-synthesistext-to-speechttsvoice-clonezero-shot-tts
Related projects:
Repository | Description | Stars |
---|---|---|
| An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. | 36,977 |
| An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. | 9,466 |
| A neural codec model for speech editing and text-to-speech synthesis in real-time, using few seconds of reference audio. | 7,744 |
| A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning | 7,719 |
| An open-source text-to-speech engine with emotion synthesis and multiple voice options | 7,522 |
| An open-source text-to-speech system trained with high-quality audio capabilities | 13,373 |
| Real-time speech synthesis using state-of-the-art architectures | 3,855 |
| A deep learning-based speech synthesis model that generates natural-sounding audio with controlled prosody. | 2,551 |
| A fast local neural text-to-speech system optimized for small devices | 7,002 |
| A deep learning toolkit for generating human-like speech from text | 36,118 |
| A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities. | 9,066 |
| Generating photorealistic avatars from audio | 2,715 |
| A text-to-audio model that generates realistic speech and other audio | 36,433 |
| An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch | 1,970 |
| A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. | 6,519 |