ClariNet

Vocoder

An implementation of a neural network-based vocoder using parallel-wavenet architecture and autoregressive flow

A Pytorch Implementation of ClariNet

GitHub

289 stars
23 watching
64 forks
Language: Python
last commit: over 5 years ago
clarinetparallel-wavenetpytorchwavenet

Related projects:

Repository Description Stars
vocalpy/vak A Python framework for training and applying neural networks to acoustic communication research 78
stenzel/simplevocoder A simple audio processing algorithm using pitch synchronous synthesis 18
gemelo-ai/vocos An audio synthesis framework that uses neural networks to generate high-quality audio waveforms from acoustic features. 824
bytedance/salmonn A large language model enabling speech, audio event perception and music inputs to achieve multilingual capabilities 1,053
kinwaicheuk/nnaudio An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data 1,032
jeremycchsu/python-wrapper-for-world-vocoder A Python wrapper for a high-quality vocoder that parameterizes speech into three components: 725
vincentherrmann/pytorch-wavenet An implementation of WaveNet for generating audio using PyTorch 975
dbaylies/jucevocoder A software implementation of a vocoder plug-in with built-in synthesis capabilities 18
deepsound-project/samplernn-pytorch An implementation of an audio generation model using PyTorch 288
kefirski/bytenet A Pytorch implementation of a neural network model for machine translation 47
keunwoochoi/auralisation Reconstructs audio features learned by convolutional neural networks into audible sounds 42
soerenab/audiomnist This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. 347
fancompute/wavetorch A Python package for solving the wave equation using recurrent neural networks and automatic differentiation. 518
npuichigo/waveglow A PyTorch implementation of a speech synthesis network based on flow-based generative architecture. 206
emmet-hayes/batsynth A polyphonic synthesizer with adjustable waveforms and tone controls. 13