VALL-E-X
TTS model
A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
8k stars
82 watching
761 forks
Language: Python
last commit: 9 months ago emotional-speechgpttext-to-speechtransformer-architecturettsvall-evoice-clone
Related projects:
Repository | Description | Stars |
---|---|---|
lifeiteng/vall-e | A PyTorch implementation of a text-to-speech synthesizer based on large language models | 2,049 |
enhuiz/vall-e | An implementation of VALL-E in PyTorch for text-to-speech synthesis | 2,964 |
rvc-boss/gpt-sovits | An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. | 35,728 |
mozilla/tts | An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. | 9,401 |
coqui-ai/tts | A deep learning toolkit for generating human-like speech from text | 35,453 |
camb-ai/mars5-tts | A deep learning-based speech synthesis model that generates natural-sounding audio with controlled prosody. | 2,530 |
metavoiceio/metavoice-src | A deep learning model for generating human-like speech | 3,891 |
netease-youdao/emotivoice | An open-source text-to-speech engine with emotion synthesis and multiple voice options | 7,436 |
rhasspy/piper | A fast local neural text-to-speech system optimized for small devices | 6,576 |
damo-nlp-sg/video-llama | An audio-visual language model designed to understand and respond to video content with improved instruction-following capabilities | 2,802 |
jackmort/chatgpt.nvim | A plugin for Neovim that integrates with the ChatGPT API to generate natural language responses and assist with coding tasks. | 3,779 |
thunlp/plmpapers | Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. | 3,328 |
espeak-ng/espeak-ng | A text-to-speech synthesizer that supports multiple languages and is compact in size. | 4,224 |
eleutherai/lm-evaluation-harness | Provides a unified framework to test generative language models on various evaluation tasks. | 6,970 |
brexhq/prompt-engineering | Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. | 8,440 |