audio-diffusion-pytorch
Audio generator
An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input
Audio generation using diffusion models, in PyTorch.
2k stars
40 watching
168 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
artificial-intelligenceaudio-generationdeep-learningdenoising-diffusion
Related projects:
Repository | Description | Stars |
---|---|---|
deepsound-project/samplernn-pytorch | An implementation of an audio generation model using PyTorch | 288 |
superkogito/pydiogment | A Python library for generating multiple audio files based on a starting mono audio file with various effects such as speed change, tone alteration and noise addition. | 83 |
birch-san/diffusers | A toolkit for creating and manipulating state-of-the-art diffusion models in PyTorch | 8 |
akanimax/pro_gan_pytorch | Implementation of a deep learning model for generating high-quality images with improved stability and variation. | 536 |
kinwaicheuk/nnaudio | An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data | 1,032 |
chrisdonahue/wavegan | An open-source machine learning algorithm for generating raw audio waveforms from raw data | 1,330 |
vincentherrmann/pytorch-wavenet | An implementation of WaveNet for generating audio using PyTorch | 975 |
soroushmehr/samplernn_iclr2017 | An unconditional end-to-end neural audio generation model utilizing a recurrent neural network architecture. | 537 |
peihaochen/regnet | An implementation of a neural network for generating sound from video sequences | 52 |
rbbrdckybk/ai-art-generator | Automates large batches of AI-generated artwork locally using GPU acceleration. | 634 |
ibm/max-audio-sample-generator | A tool to generate audio samples based on input commands and lo-fi instrumental music tracks. | 21 |
virtualanalogy/paraphrasis | A spectral modeling synthesizer for generating sound. | 31 |
bitgamma/synthex | A library for generating and processing audio signals | 44 |
ashual/scene_generation | A PyTorch implementation of a deep learning-based method for generating interactive scenes with specified object attributes and relations | 187 |
auto1111sdk/auto1111sdk | A Python library for generating images with stable diffusion models | 397 |