awesome-python-scientific-audio

Audio analysis library

A curated collection of Python packages and tools for scientific research in audio and music applications

Curated list of python software and packages related to scientific research in audio

GitHub

2k stars
77 watching
170 forks
last commit: over 1 year ago
Linked from 6 awesome lists

audioawesome-listpython

audiolazy 692 over 2 years ago Expressive Digital Signal Processing (DSP) package for Python
audioread 497 11 months ago Cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding
mutagen Reads and writes all kind of audio metadata for various formats
pyAV PyAV is a Pythonic binding for FFmpeg or Libav
(Py)Soundfile Library based on libsndfile, CFFI, and NumPy
pySox 519 7 months ago Wrapper for sox
stempeg 96 5 months ago read/write of STEMS multistream audio
tinytag 714 about 1 month ago reading music meta data of MP3, OGG, FLAC and Wave files
acoustics useful tools for acousticians
AudioTK 252 about 4 years ago DSP filter toolbox (lots of filters)
AudioTSM real-time audio time-scale modification procedures
Gammatone 220 over 1 year ago Gammatone filterbank implementation
pyFFTW Wrapper for FFTW(3)
NSGT Non-stationary gabor transform, constant-q
matchering 1,842 about 2 months ago Automated reference audio mastering
MDCT 51 over 2 years ago MDCT transform
pydub Manipulate audio with a simple and easy high level interface
pytftb Implementation of the MATLAB Time-Frequency Toolbox
pyroomacoustics 1,480 about 1 month ago Room Acoustics Simulation (RIR generator)
PyRubberband 167 4 months ago Wrapper for to do pitch-shifting and time-stretching
PyWavelets Discrete Wavelet Transform in Python
Resampy Sample rate conversion
SFS-Python Sound Field Synthesis Toolbox
sound_field_analysis Analyze, visualize and process sound field data recorded by spherical microphone arrays
STFT Standalone package for Short-Time Fourier Transform
aubio Feature extractor, written in C, Python interface
audioFlux 2,940 8 months ago A library for audio and music analysis, feature extraction
audiolazy 692 over 2 years ago Realtime Audio Processing lib, general purpose
essentia Music related low level and high level feature extractor, C++ based, includes Python bindings
python_speech_features 2,379 about 3 years ago Common speech features for ASR
pyYAAFE 244 over 3 years ago Python bindings for YAAFE feature extractor
speechpy 881 about 3 years ago Library for Speech Processing and Recognition, mostly feature extraction for now
spafe 461 7 months ago Python library for features extraction from audio files
audiomentations 1,903 about 1 month ago Audio Data Augmentation
muda Musical Data Augmentation
pydiogment 83 over 1 year ago Audio Data Augmentation
aeneas Forced aligner, based on MFCC+DTW, 35+ languages
deepspeech 25,475 5 months ago Pretrained automatic speech recognition
gentle 1,471 9 months ago Forced-aligner built on Kaldi
Parselmouth 1,076 3 months ago Python interface to the phonetics and speech analysis, synthesis, and manipulation software
persephone Automatic phoneme transcription tool
pyannote.audio 6,508 about 1 month ago Neural building blocks for speaker diarization
pyAudioAnalysis 5,918 10 months ago ² - Feature Extraction, Classification, Diarization
py-webrtcvad 2,088 7 months ago Interface to the WebRTC Voice Activity Detector
pypesq 360 over 1 year ago Wrapper for the PESQ score calculation
pystoi 327 about 1 year ago Short Term Objective Intelligibility measure (STOI)
PyWorldVocoder 729 about 1 year ago Wrapper for Morise's World Vocoder
Montreal Forced Aligner Forced aligner, based on Kaldi (HMM), English (others can be trained)
SIDEKIT Speaker and Language recognition
SpeechRecognition 8,483 about 1 month ago Wrapper for several ASR engines and APIs, online and offline
sed_eval Evaluation toolbox for Sound Event Detection
cochlea 109 6 months ago Inner ear models
Brian2 Spiking neural networks simulator, includes cochlea model
Loudness 36 over 5 years ago Perceived loudness, includes Zwicker, Moore/Glasberg model
pyloudnorm Audio loudness meter and normalization, implements ITU-R BS.1770-4
Sound Field Synthesis Toolbox Sound Field Synthesis Toolbox
commonfate 17 almost 5 years ago Common Fate Model and Transform
NTFLib 48 about 9 years ago Sparse Beta-Divergence Tensor Factorization
NUSSL Holistic source separation framework including DSP methods and deep learning methods
NIMFA Several flavors of non-negative-matrix factorization
Catchy 21 about 8 years ago Corpus Analysis Tools for Computational Hook Discovery
chord-detection 114 over 1 year ago Algorithms for chord detection and key estimation
Madmom MIR packages with strong focus on beat detection, onset detection and chord recognition
mir_eval Common scores for various MIR tasks. Also includes bss_eval implementation
msaf Music Structure Analysis Framework
librosa General audio and music analysis
Kapre 924 about 1 year ago Keras Audio Preprocessors
TorchAudio 2,561 about 1 month ago PyTorch Audio Loaders
nnAudio 1,036 11 months ago Accelerated audio processing using 1D convolution networks in PyTorch
Music21 Toolkit for Computer-Aided Musicology
Mido Realtime MIDI wrapper
mingus 867 9 months ago Advanced music theory and notation package with MIDI file and playback support
Pretty-MIDI Utility functions for handling MIDI data in a nice/intuitive way
Jupylet 231 12 months ago Subtractive, additive, FM, and sample-based sound synthesis
PYO Realtime audio dsp engine
python-sounddevice 1,069 3 months ago PortAudio wrapper providing realtime audio I/O with NumPy
ReTiSAR 71 about 1 year ago Binarual rendering of streamed or IR-based high-order spherical microphone array signals
TimeSide (Beta) 374 3 months ago high level audio analysis, imaging, transcoding, streaming and labelling
beets Music library manager and tagger
musdb Parse and process the MUSDB18 dataset
medleydb Parse audio + annotations
Soundcloud API 105 12 months ago Wrapper for
Youtube-Downloader Download youtube videos (and the audio)
audiomate 133 over 1 year ago Loading different types of audio datasets
mirdata Common loaders for Music Information Retrieval (MIR) datasets
VamPy Host Interface compiled vamp plugins

Python for Scientific Audio / Tutorials

Whirlwind Tour Of Python fast-paced introduction to Python essentials, aimed at researchers and developers
Introduction to Numpy and Scipy Highly recommended tutorial, covers large parts of the scientific Python ecosystem
Numpy for MATLAB® Users Short overview of equivalent python functions for switchers
MIR Notebooks collection of instructional iPython Notebooks for music information retrieval (MIR)
Selected Topics in Audio Signal Processing 64 over 3 years ago Exercises as iPython notebooks
Live-coding a music synthesizer Live-coding video showing how to use the SoundDevice library to reproduce realistic sounds.

Python for Scientific Audio / Books

Python Data Science Handbook 43,422 7 months ago Jake Vanderplas, Excellent Book and accompanying tutorial notebooks
Fundamentals of Music Processing Meinard Müller, comes with Python exercises

Python for Scientific Audio / Scientific Papers

Python for audio signal processing John C. Glover, Victor Lazzarini and Joseph Timoney, Linux Audio Conference 2011
librosa: Audio and Music Signal Analysis in Python , - Brian McFee, Colin Raffel, Dawen Liang, Daniel P.W. Ellis, Matt McVicar, Eric Battenberg, Oriol Nieto, Scipy 2015
pyannote.audio: neural building blocks for speaker diarization , - Hervé Bredin, Ruiqing Yin, Juan Manuel Coria, Gregory Gelly, Pavel Korshunov, Marvin Lavechin, Diego Fustes, Hadrien Titeux, Wassim Bouaziz, Marie-Philippe Gill, ICASSP 2020

Python for Scientific Audio / Other Resources

Coursera Course Audio Signal Processing, Python based course from UPF of Barcelona and Stanford University
Digital Signal Processing Course Masters Course Material (University of Rostock) with many Python examples
Slack Channel Music Information Retrieval Community

Backlinks from these awesome lists:

More related projects: