vall-e
VALL-E implementation
An implementation of VALL-E in PyTorch for text-to-speech synthesis
An unofficial PyTorch implementation of the audio LM VALL-E
3k stars
87 watching
419 forks
Language: Python
last commit: almost 2 years ago audio-lmpytorchtext-to-speechttsvall-evalle
Related projects:
Repository | Description | Stars |
---|---|---|
| A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning | 7,719 |
| A deep learning library for generating high-quality audio | 21,134 |
| Developing and pretraining a GPT-like Large Language Model from scratch | 35,405 |
| A PyTorch module providing tools and functions for audio signal processing | 2,561 |
| A PyTorch implementation of a text-to-speech synthesizer based on large language models | 2,062 |
| Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. | 3,189 |
| A collection of Variational AutoEncoder implementations in PyTorch | 6,776 |
| This PyTorch implementation provides a toolkit for speech synthesis using a deep neural network architecture. | 5,123 |
| Implementation of a face parsing model using PyTorch and a modified BiSeNet architecture. | 2,346 |
| A comprehensive Python library for feature extraction, classification, segmentation, and applications of audio data. | 5,918 |
| A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,508 |
| A PyTorch library for implementing deep metric learning algorithms in computer vision applications. | 6,045 |
| A collection of machine learning algorithms implemented in NumPy for rapid experimentation and prototyping. | 15,789 |
| A comprehensive library for training and applying deep learning models for image segmentation | 9,829 |
| A PyTorch implementation of the DeepLab-V3-Plus model with support for multiple backbones and datasets | 2,919 |