audio
Audio toolkit
A PyTorch module providing tools and functions for audio signal processing
Data manipulation and transformation for audio signal processing, powered by PyTorch
3k stars
72 watching
653 forks
Language: Python
last commit: 6 days ago
Linked from 4 awesome lists
audioaudio-processingiomachine-learningpythonpytorchspeech
Related projects:
Repository | Description | Stars |
---|---|---|
facebookresearch/audiocraft | A deep learning library for generating high-quality audio | 20,969 |
kinwaicheuk/nnaudio | An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data | 1,032 |
pytorch/pytorch | A Python library providing tensors and dynamic neural networks with strong GPU acceleration | 83,959 |
lucidrains/musiclm-pytorch | Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. | 3,166 |
archinetai/audio-diffusion-pytorch | An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input | 1,961 |
tyiannak/pyaudioanalysis | A comprehensive Python library for feature extraction, classification, segmentation, and applications of audio data. | 5,885 |
pyannote/pyannote-audio | A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,333 |
pytorch/torchtune | A library that provides an easy-to-use interface for authoring, finetuning, and experimenting with large language models | 4,320 |
nvidia/tacotron2 | This PyTorch implementation provides a toolkit for speech synthesis using a deep neural network architecture. | 5,099 |
deepsound-project/samplernn-pytorch | An implementation of an audio generation model using PyTorch | 288 |
nvidia/apex | Tools for streamlined mixed precision and distributed training in PyTorch | 8,407 |
libaudioflux/audioflux | A deep learning tool library for extracting features from audio signals. | 2,915 |
pytorch/torchtitan | A native PyTorch library for large-scale language model training with distributed training capabilities | 2,615 |
microsoft/torchgeo | A PyTorch library providing datasets, samplers, transforms, and pre-trained models for working with geospatial data in machine learning and remote sensing applications. | 2,753 |
lcav/pyroomacoustics | Software package for rapid development and testing of audio array processing algorithms in indoor applications | 1,460 |