encodec
Audio codec
A deep learning-based audio codec that supports high-fidelity neural audio compression.
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
4k stars
57 watching
304 forks
Language: Python
last commit: 11 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
facebookresearch/audiocraft | A deep learning library for generating high-quality audio | 20,969 |
xiph/rnnoise | A deep learning-based audio noise reduction system using recurrent neural networks | 4,127 |
intel/neural-compressor | Tools and techniques for optimizing large language models on various frameworks and hardware platforms. | 2,226 |
facebookresearch/audio2photoreal | Generating photorealistic avatars from audio | 2,709 |
libaudioflux/audioflux | A deep learning tool library for extracting features from audio signals. | 2,915 |
facebookresearch/demucs | A deep learning model that separates multiple audio sources from mixed music tracks | 8,347 |
enhuiz/vall-e | An implementation of VALL-E in PyTorch for text-to-speech synthesis | 2,964 |
deep-floyd/if | A text-to-image synthesis model with a modular design, utilizing a frozen text encoder and cascaded pixel diffusion modules to generate photorealistic images. | 7,688 |
lucidrains/musiclm-pytorch | Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. | 3,166 |
mubertai/mubert-text-to-music | Generates music based on user input prompts using the Mubert API | 2,733 |
nvidia/waveglow | Generates high-quality speech from mel-spectrograms using a flow-based network architecture | 2,285 |
oxford-cs-deepnlp-2017/lectures | An open-source repository containing lecture slides and course materials for an advanced natural language processing course. | 15,683 |
soerenab/audiomnist | This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. | 347 |
csteinmetz1/micro-tcn | A software framework for efficient modeling of analog audio dynamic range compression using neural networks | 151 |
pyannote/pyannote-audio | A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,333 |