encodec

Audio codec

A deep learning-based audio codec that supports high-fidelity neural audio compression.

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

GitHub

4k stars
57 watching
304 forks
Language: Python
last commit: 11 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
facebookresearch/audiocraft A deep learning library for generating high-quality audio 20,969
xiph/rnnoise A deep learning-based audio noise reduction system using recurrent neural networks 4,127
intel/neural-compressor Tools and techniques for optimizing large language models on various frameworks and hardware platforms. 2,226
facebookresearch/audio2photoreal Generating photorealistic avatars from audio 2,709
libaudioflux/audioflux A deep learning tool library for extracting features from audio signals. 2,915
facebookresearch/demucs A deep learning model that separates multiple audio sources from mixed music tracks 8,347
enhuiz/vall-e An implementation of VALL-E in PyTorch for text-to-speech synthesis 2,964
deep-floyd/if A text-to-image synthesis model with a modular design, utilizing a frozen text encoder and cascaded pixel diffusion modules to generate photorealistic images. 7,688
lucidrains/musiclm-pytorch Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. 3,166
mubertai/mubert-text-to-music Generates music based on user input prompts using the Mubert API 2,733
nvidia/waveglow Generates high-quality speech from mel-spectrograms using a flow-based network architecture 2,285
oxford-cs-deepnlp-2017/lectures An open-source repository containing lecture slides and course materials for an advanced natural language processing course. 15,683
soerenab/audiomnist This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. 347
csteinmetz1/micro-tcn A software framework for efficient modeling of analog audio dynamic range compression using neural networks 151
pyannote/pyannote-audio A toolkit for speaker diarization using PyTorch and speech activity detection. 6,333