vall-e
VALL-E implementation
An implementation of VALL-E in PyTorch for text-to-speech synthesis
An unofficial PyTorch implementation of the audio LM VALL-E
3k stars
87 watching
419 forks
Language: Python
last commit: over 2 years ago audio-lmpytorchtext-to-speechttsvall-evalle
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning | 7,719 |
| | A deep learning library for generating high-quality audio | 21,134 |
| | Developing and pretraining a GPT-like Large Language Model from scratch | 35,405 |
| | A PyTorch module providing tools and functions for audio signal processing | 2,561 |
| | A PyTorch implementation of a text-to-speech synthesizer based on large language models | 2,062 |
| | Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. | 3,189 |
| | A collection of Variational AutoEncoder implementations in PyTorch | 6,776 |
| | This PyTorch implementation provides a toolkit for speech synthesis using a deep neural network architecture. | 5,123 |
| | Implementation of a face parsing model using PyTorch and a modified BiSeNet architecture. | 2,346 |
| | A comprehensive Python library for feature extraction, classification, segmentation, and applications of audio data. | 5,918 |
| | A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,508 |
| | A PyTorch library for implementing deep metric learning algorithms in computer vision applications. | 6,045 |
| | A collection of machine learning algorithms implemented in NumPy for rapid experimentation and prototyping. | 15,789 |
| | A comprehensive library for training and applying deep learning models for image segmentation | 9,829 |
| | A PyTorch implementation of the DeepLab-V3-Plus model with support for multiple backbones and datasets | 2,919 |