encodec

Audio codec

A deep learning-based audio codec that supports high-fidelity neural audio compression.

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

GitHub

4k stars

57 watching

309 forks

Language: Python

last commit: over 1 year ago

Linked from 1 awesome list

Backlinks from these awesome lists:

amrzv/awesome-colab-notebooks

Related projects:

Repository	Description	Stars
facebookresearch/audiocraft	A deep learning library for generating high-quality audio	21,134
xiph/rnnoise	A deep learning-based audio noise reduction system using recurrent neural networks	4,191
intel/neural-compressor	Tools and techniques for optimizing large language models on various frameworks and hardware platforms.	2,257
facebookresearch/audio2photoreal	Generating photorealistic avatars from audio	2,715
libaudioflux/audioflux	A deep learning tool library for extracting features from audio signals.	2,940
facebookresearch/demucs	A deep learning model that separates multiple audio sources from mixed music tracks	8,453
enhuiz/vall-e	An implementation of VALL-E in PyTorch for text-to-speech synthesis	2,970
deep-floyd/if	A text-to-image synthesis model with a modular design, utilizing a frozen text encoder and cascaded pixel diffusion modules to generate photorealistic images.	7,699
lucidrains/musiclm-pytorch	Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning.	3,189
mubertai/mubert-text-to-music	Generates music based on user input prompts using the Mubert API	2,738
nvidia/waveglow	Generates high-quality speech from mel-spectrograms using a flow-based network architecture	2,294
oxford-cs-deepnlp-2017/lectures	An open-source repository containing lecture slides and course materials for an advanced natural language processing course.	15,702
soerenab/audiomnist	This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques.	351
csteinmetz1/micro-tcn	A software framework for efficient modeling of analog audio dynamic range compression using neural networks	150
pyannote/pyannote-audio	A toolkit for speaker diarization using PyTorch and speech activity detection.	6,508