audio

Audio toolkit

A PyTorch module providing tools and functions for audio signal processing

Data manipulation and transformation for audio signal processing, powered by PyTorch

GitHub

3k stars
73 watching
659 forks
Language: Python
last commit: about 2 months ago
Linked from 4 awesome lists

audioaudio-processingiomachine-learningpythonpytorchspeech

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
facebookresearch/audiocraft A deep learning library for generating high-quality audio 21,134
kinwaicheuk/nnaudio An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data 1,036
pytorch/pytorch A Python library providing tensors and dynamic neural networks with strong GPU acceleration 84,978
lucidrains/musiclm-pytorch Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. 3,189
archinetai/audio-diffusion-pytorch An audio generation library that uses diffusion models to produce high-quality audio samples from noise or text input 1,975
tyiannak/pyaudioanalysis A comprehensive Python library for feature extraction, classification, segmentation, and applications of audio data. 5,918
pyannote/pyannote-audio A toolkit for speaker diarization using PyTorch and speech activity detection. 6,508
pytorch/torchtune A PyTorch library for easily authoring and experimenting with large language models 4,479
nvidia/tacotron2 This PyTorch implementation provides a toolkit for speech synthesis using a deep neural network architecture. 5,123
deepsound-project/samplernn-pytorch An implementation of an audio generation model using PyTorch 290
nvidia/apex Tools for streamlined mixed precision and distributed training in PyTorch 8,460
libaudioflux/audioflux A deep learning tool library for extracting features from audio signals. 2,940
pytorch/torchtitan A native PyTorch library for training large language models using distributed parallelism and optimization techniques. 2,765
microsoft/torchgeo Provides tools and pre-trained models for working with geospatial data in machine learning applications 3,083
lcav/pyroomacoustics Software package for rapid development and testing of audio array processing algorithms in indoor applications 1,480