encodec
Audio codec
A deep learning-based audio codec that supports high-fidelity neural audio compression.
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
4k stars
57 watching
309 forks
Language: Python
last commit: about 1 year ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
facebookresearch/audiocraft | A deep learning library for generating high-quality audio | 21,134 |
xiph/rnnoise | A deep learning-based audio noise reduction system using recurrent neural networks | 4,191 |
intel/neural-compressor | Tools and techniques for optimizing large language models on various frameworks and hardware platforms. | 2,257 |
facebookresearch/audio2photoreal | Generating photorealistic avatars from audio | 2,715 |
libaudioflux/audioflux | A deep learning tool library for extracting features from audio signals. | 2,940 |
facebookresearch/demucs | A deep learning model that separates multiple audio sources from mixed music tracks | 8,453 |
enhuiz/vall-e | An implementation of VALL-E in PyTorch for text-to-speech synthesis | 2,970 |
deep-floyd/if | A text-to-image synthesis model with a modular design, utilizing a frozen text encoder and cascaded pixel diffusion modules to generate photorealistic images. | 7,699 |
lucidrains/musiclm-pytorch | Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. | 3,189 |
mubertai/mubert-text-to-music | Generates music based on user input prompts using the Mubert API | 2,738 |
nvidia/waveglow | Generates high-quality speech from mel-spectrograms using a flow-based network architecture | 2,294 |
oxford-cs-deepnlp-2017/lectures | An open-source repository containing lecture slides and course materials for an advanced natural language processing course. | 15,702 |
soerenab/audiomnist | This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. | 351 |
csteinmetz1/micro-tcn | A software framework for efficient modeling of analog audio dynamic range compression using neural networks | 150 |
pyannote/pyannote-audio | A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,508 |