waveglow

Speech Synthesis Network

A PyTorch implementation of a speech synthesis network based on flow-based generative architecture.

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

GitHub

206 stars
16 watching
35 forks
Language: Python
last commit: about 6 years ago
neural-vocodertext-to-speechwaveglow

Related projects:

Repository Description Stars
fgxaos/pytorch-innvestigate PyTorch implementation of an explainability technique for deep neural networks 9
nvidia/waveglow Generates high-quality speech from mel-spectrograms using a flow-based network architecture 2,285
seannaren/deepspeech.pytorch A deep learning-based speech recognition system built on top of PyTorch Lightning. 2,104
hanzhanggit/stackgan A PyTorch implementation of a generative adversarial network for image synthesis from text descriptions 1,860
ypxie/pytorch-neucom An implementation of the Differentiable Neural Computer architecture in PyTorch 94
deepsound-project/samplernn-pytorch An implementation of an audio generation model using PyTorch 288
archinetai/audio-diffusion-pytorch An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input 1,967
r9y9/tacotron_pytorch An implementation of Tacotron speech synthesis model using PyTorch. 309
jayleicn/animegan A PyTorch implementation of Generative Adversarial Networks for anime face drawing 1,280
kinwaicheuk/nnaudio An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data 1,032
chingyaoc/ggnn.pytorch An implementation of a neural network architecture for processing graph-structured data and making predictions on nodes. 465
gram-ai/radio-transformer-networks An implementation of a machine learning-based communications system using deep learning techniques. 127
michaelklachko/pnn.pytorch A PyTorch implementation of a neural network architecture that modifies the input to its layers by applying noise masks. 57
peihaochen/regnet An implementation of a neural network for generating sound from video sequences 52
awni/speech A PyTorch implementation of end-to-end speech recognition models. 754