waveglow
Speech Synthesis Network
A PyTorch implementation of a speech synthesis network based on flow-based generative architecture.
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
206 stars
16 watching
35 forks
Language: Python
last commit: about 6 years ago neural-vocodertext-to-speechwaveglow
Related projects:
Repository | Description | Stars |
---|---|---|
fgxaos/pytorch-innvestigate | PyTorch implementation of an explainability technique for deep neural networks | 9 |
nvidia/waveglow | Generates high-quality speech from mel-spectrograms using a flow-based network architecture | 2,285 |
seannaren/deepspeech.pytorch | A deep learning-based speech recognition system built on top of PyTorch Lightning. | 2,104 |
hanzhanggit/stackgan | A PyTorch implementation of a generative adversarial network for image synthesis from text descriptions | 1,860 |
ypxie/pytorch-neucom | An implementation of the Differentiable Neural Computer architecture in PyTorch | 94 |
deepsound-project/samplernn-pytorch | An implementation of an audio generation model using PyTorch | 288 |
archinetai/audio-diffusion-pytorch | An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input | 1,967 |
r9y9/tacotron_pytorch | An implementation of Tacotron speech synthesis model using PyTorch. | 309 |
jayleicn/animegan | A PyTorch implementation of Generative Adversarial Networks for anime face drawing | 1,280 |
kinwaicheuk/nnaudio | An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data | 1,032 |
chingyaoc/ggnn.pytorch | An implementation of a neural network architecture for processing graph-structured data and making predictions on nodes. | 465 |
gram-ai/radio-transformer-networks | An implementation of a machine learning-based communications system using deep learning techniques. | 127 |
michaelklachko/pnn.pytorch | A PyTorch implementation of a neural network architecture that modifies the input to its layers by applying noise masks. | 57 |
peihaochen/regnet | An implementation of a neural network for generating sound from video sequences | 52 |
awni/speech | A PyTorch implementation of end-to-end speech recognition models. | 754 |