music-audio-tagging-at-scale-models

Audio tagging research

Research on end-to-end learning for music audio tagging using large datasets and different front-end paradigms.

Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"

GitHub

149 stars

6 watching

19 forks

Language: Python

last commit: almost 7 years ago

Related projects:

Repository	Description	Stars
jordipons/eusipco2017	Research code for music auto-tagging using deep learning and feature extraction	23
jongpillee/musictagging_msd	This project is an audio classification system trained on the MSD tagging dataset, enabling automatic tagging of music files with relevant genres and styles.	7
microsoft/pengi	An Audio Language Model framework that uses transfer learning to generate text from audio inputs	295
balavenkatesh3322/audio-pretrained-model	A collection of pre-trained audio and speech models for various applications	183
ibm/max-audio-classifier	Identifies sounds in short audio clips using machine learning and PCA transformation	154
yuangongnd/whisper-at	An audio processing model that adds audio event tagging capabilities to an existing speech recognition system with minimal additional computational cost.	343
jthorborg/ape	An Audio Programming Environment with support for AU and DSP plugins	14
iver56/audiomentations	Library for audio data augmentation used in machine learning	1,903
yuangongnd/ltu	An audio and speech large language model implementation with pre-trained models, datasets, and inference options	396
kristijanbartol/deep-music-tagger	Classifies music genres based on audio features using a deep learning model	68
soundio/soundstage	A graph object model and sequencing engine for Web Audio processing graphs	65
soerenab/audiomnist	This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques.	351
ynop/audiomate	A Python library for handling audio datasets, providing tools for accessing, manipulating, and preparing data for machine learning tasks.	133
cpjku/madmom	A Python audio signal processing library used in music information retrieval tasks.	1,366
keunwoochoi/auralisation	Reconstructs audio features learned by convolutional neural networks into audible sounds	42