MAX-Audio-Classifier
Audio classifier
Identifies sounds in short audio clips using machine learning and PCA transformation
Identify sounds in short audio clips
153 stars
29 watching
53 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
audio-classificationdocker-imagekeras-tensorflowmachine-learningmachine-learning-models
Related projects:
Repository | Description | Stars |
---|---|---|
ibm/max-audio-sample-generator | A tool to generate audio samples based on input commands and lo-fi instrumental music tracks. | 21 |
yongxuustc/dcase2017_task4_cvssp | A system for audio classification and detection using machine learning models | 4 |
drscotthawley/audio-classifier-keras-cnn | An audio classification system using a convolutional neural network to classify audio data | 160 |
soerenab/audiomnist | This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. | 347 |
ibm/max-audio-embedding-generator | An audio embedding model generator | 56 |
ibm/max-scene-classifier | An image classification model for recognizing physical places and locations | 41 |
ibm/max-sports-video-classifier | This project provides a pre-trained video classification model that categorizes sports videos into their respective sports. | 23 |
ibm/max-speech-to-text-converter | Converts spoken words into text form using speech recognition technology | 76 |
mlachmish/musicgenreclassification | Classify music genre from a 10-second sound stream using a neural network. | 562 |
keunwoochoi/auralisation | Reconstructs audio features learned by convolutional neural networks into audible sounds | 42 |
iver56/audiomentations | Library for audio data augmentation used in machine learning | 1,873 |
cpjku/madmom | A Python audio signal processing library used in music information retrieval tasks. | 1,347 |
microsoft/pengi | An Audio Language Model framework that uses transfer learning to generate text from audio inputs | 290 |
bytedance/salmonn | A large language model enabling speech, audio event perception and music inputs to achieve multilingual capabilities | 1,053 |
audiojs/audio | A JavaScript class for manipulating audio data | 240 |