demucs

Audio source separator

A deep learning model that separates multiple audio sources from mixed music tracks

Code for the paper Hybrid Spectrogram and Waveform Source Separation

GitHub

8k stars
155 watching
1k forks
Language: Python
last commit: 9 months ago

Related projects:

Repository Description Stars
adefossez/mdx21_demucs A repository containing pre-trained models and code for music demixing using the Hybrid Demucs model 103
facebookresearch/encodec A deep learning-based audio codec that supports high-fidelity neural audio compression. 3,536
lucidrains/musiclm-pytorch Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. 3,189
mtg/deepconvsep A framework for training deep neural networks to separate music sources from audio files. 474
facebookresearch/metaseq A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. 6,519
js-mim/mss_pytorch This project provides a PyTorch implementation of a singing voice separation algorithm using recurrent inference and skip-filtering connections. 171
facebookresearch/detectron2 A platform for object detection and segmentation tasks using machine learning algorithms 30,778
abdullah-abuolaim/recurrent-defocus-deblurring-synth-dual-pixel This project provides tools and models to generate realistic data for camera systems with defocus blur, aiming to improve image deblurring techniques. 49
mubertai/mubert-text-to-music Generates music based on user input prompts using the Mubert API 2,738
libaudioflux/audioflux A deep learning tool library for extracting features from audio signals. 2,940
mlachmish/musicgenreclassification Classify music genre from a 10-second sound stream using a neural network. 565
facebookresearch/audio2photoreal Generating photorealistic avatars from audio 2,715
invokerer/deeprft Deblurring technique using deep learning and Fourier transformation to remove image blur 248
facebookresearch/mmf A modular framework for building vision and language multimodal research projects using PyTorch. 5,510
facebookresearch/spiritlm This repository provides an end-to-end language model capable of generating coherent text based on both spoken and written inputs. 845