music-audio-tagging-at-scale-models
Audio tagging research
Research on end-to-end learning for music audio tagging using large datasets and different front-end paradigms.
Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"
149 stars
6 watching
19 forks
Language: Python
last commit: over 5 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| Research code for music auto-tagging using deep learning and feature extraction | 23 |
| This project is an audio classification system trained on the MSD tagging dataset, enabling automatic tagging of music files with relevant genres and styles. | 7 |
| An Audio Language Model framework that uses transfer learning to generate text from audio inputs | 295 |
| A collection of pre-trained audio and speech models for various applications | 183 |
| Identifies sounds in short audio clips using machine learning and PCA transformation | 154 |
| An audio processing model that adds audio event tagging capabilities to an existing speech recognition system with minimal additional computational cost. | 343 |
| An Audio Programming Environment with support for AU and DSP plugins | 14 |
| Library for audio data augmentation used in machine learning | 1,903 |
| An audio and speech large language model implementation with pre-trained models, datasets, and inference options | 396 |
| Classifies music genres based on audio features using a deep learning model | 68 |
| A graph object model and sequencing engine for Web Audio processing graphs | 65 |
| This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. | 351 |
| A Python library for handling audio datasets, providing tools for accessing, manipulating, and preparing data for machine learning tasks. | 133 |
| A Python audio signal processing library used in music information retrieval tasks. | 1,366 |
| Reconstructs audio features learned by convolutional neural networks into audible sounds | 42 |