pytorch-wavenet

Audio generator

An implementation of WaveNet for generating audio using PyTorch

An implementation of WaveNet with fast generation

GitHub

979 stars
30 watching
229 forks
Language: Jupyter Notebook
last commit: over 4 years ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
deepsound-project/samplernn-pytorch An implementation of an audio generation model using PyTorch 290
peihaochen/regnet An implementation of a neural network for generating sound from video sequences 52
chrisdonahue/wavegan An open-source machine learning algorithm for generating raw audio waveforms from raw data 1,334
archinetai/audio-diffusion-pytorch An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input 1,975
kinwaicheuk/nnaudio An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data 1,036
soroushmehr/samplernn_iclr2017 An unconditional end-to-end neural audio generation model utilizing a recurrent neural network architecture. 537
ycollet/psi-plugins A collection of LV2 plugins for generating sound in electronic music 9
ypxie/hdgan A PyTorch implementation of an adversarial network for generating high-definition images from text descriptions. 150
orhun/linuxwave Generates music from random data 541
tomrunia/pytorchwavelets PyTorch implementation of wavelet analysis for signal processing 307
npuichigo/waveglow A PyTorch implementation of a speech synthesis network based on flow-based generative architecture. 206
ksw0306/clarinet An implementation of a neural network-based vocoder using parallel-wavenet architecture and autoregressive flow 290
jayleicn/animegan A PyTorch implementation of Generative Adversarial Networks for anime face drawing 1,282
r9y9/tacotron_pytorch An implementation of Tacotron speech synthesis model using PyTorch. 309
v-iashin/specvqgan A project to train an audio generation model that uses visual cues to produce high-quality sounds from a reduced dataset of representative vectors. 353