audiocraft

Audio generator

A deep learning library for generating high-quality audio

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

GitHub

21k stars

211 watching

2k forks

Language: Python

last commit: 9 months ago

Linked from 2 awesome lists

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
pytorch/audio	A PyTorch module providing tools and functions for audio signal processing	2,561
lucidrains/musiclm-pytorch	Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning.	3,189
archinetai/audio-diffusion-pytorch	An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input	1,975
libaudioflux/audioflux	A deep learning tool library for extracting features from audio signals.	2,940
enhuiz/vall-e	An implementation of VALL-E in PyTorch for text-to-speech synthesis	2,970
deepsound-project/samplernn-pytorch	An implementation of an audio generation model using PyTorch	290
tyiannak/pyaudioanalysis	A comprehensive Python library for feature extraction, classification, segmentation, and applications of audio data.	5,918
facebookresearch/encodec	A deep learning-based audio codec that supports high-fidelity neural audio compression.	3,536
spotify/pedalboard	A Python library for processing and manipulating audio data	5,286
pyannote/pyannote-audio	A toolkit for speaker diarization using PyTorch and speech activity detection.	6,508
facebookresearch/audio2photoreal	Generating photorealistic avatars from audio	2,715
haoheliu/audioldm	A Python-based audio generation tool that can produce speech, sound effects, music, and more, using text as input or guided by user description.	2,483
jasonppy/voicecraft	A neural codec model for speech editing and text-to-speech synthesis in real-time, using few seconds of reference audio.	7,744
aigc-audio/audiogpt	An audio processing toolkit that provides pre-trained models and tools for tasks like speech synthesis, music generation, sound detection, and talking head creation.	10,061
nvidia/waveglow	Generates high-quality speech from mel-spectrograms using a flow-based network architecture	2,294