vall-e
Text-to-speech system
A PyTorch implementation of a text-to-speech synthesizer based on large language models
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
2k stars
49 watching
320 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
chatgptin-context-learninglarge-language-modelstext-to-speechttsvall-evalle
Related projects:
Repository | Description | Stars |
---|---|---|
| A PyTorch implementation of an end-to-end text-to-speech synthesis model. | 207 |
| A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning | 7,719 |
| An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch | 1,970 |
| An implementation of a deep learning-based text-to-speech model using the FFTNet architecture. | 81 |
| An implementation of Tacotron speech synthesis model using PyTorch. | 309 |
| A PyTorch implementation of end-to-end speech recognition models. | 756 |
| Real-time end-to-end singing voice conversion system using DDSP models and various machine learning techniques | 1,920 |
| A text processing system designed to handle various tasks in Hungarian language processing using Python and TSV-based data exchange. | 28 |
| A PyTorch implementation of an encoder-free vision-language model that can be fine-tuned for various tasks and modalities | 246 |
| A generative speech model designed to synthesize natural and expressive dialogue in interactive conversations. | 32,941 |
| Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs | 270 |
| An implementation of the GPT-2 language model in PyTorch for generating text | 976 |
| A toolkit providing easy-to-use machine learning modules and functionalities for natural language processing and text generation tasks | 745 |
| Generates images from Russian texts using AI models | 1,643 |
| An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling | 245 |