demucs

Audio source separator

A deep learning model that separates multiple audio sources from mixed music tracks

Code for the paper Hybrid Spectrogram and Waveform Source Separation

GitHub

8k stars
154 watching
1k forks
Language: Python
last commit: 7 months ago

Related projects:

Repository Description Stars
adefossez/mdx21_demucs A repository containing pre-trained models and code for music demixing using the Hybrid Demucs model 99
facebookresearch/encodec A deep learning-based audio codec that supports high-fidelity neural audio compression. 3,509
lucidrains/musiclm-pytorch Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. 3,166
mtg/deepconvsep A framework for training deep neural networks to separate music sources from audio files. 472
facebookresearch/metaseq A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. 6,515
js-mim/mss_pytorch This project provides a PyTorch implementation of a singing voice separation algorithm using recurrent inference and skip-filtering connections. 171
facebookresearch/detectron2 A platform for object detection and segmentation tasks using machine learning algorithms 30,539
abdullah-abuolaim/recurrent-defocus-deblurring-synth-dual-pixel This project provides tools and models to generate realistic data for camera systems with defocus blur, aiming to improve image deblurring techniques. 49
mubertai/mubert-text-to-music Generates music based on user input prompts using the Mubert API 2,733
libaudioflux/audioflux A deep learning tool library for extracting features from audio signals. 2,915
mlachmish/musicgenreclassification Classify music genre from a 10-second sound stream using a neural network. 562
facebookresearch/audio2photoreal Generating photorealistic avatars from audio 2,709
invokerer/deeprft Develops deep learning-based methods for removing blur and defocus from images 244
facebookresearch/mmf A modular framework for building vision and language multimodal research projects using PyTorch. 5,500
facebookresearch/spiritlm This repository provides an end-to-end language model capable of generating coherent text based on both spoken and written inputs. 777