MAX-Audio-Classifier

Audio classifier

Identifies sounds in short audio clips using machine learning and PCA transformation

Identify sounds in short audio clips

GitHub

153 stars
29 watching
53 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list

audio-classificationdocker-imagekeras-tensorflowmachine-learningmachine-learning-models

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ibm/max-audio-sample-generator A tool to generate audio samples based on input commands and lo-fi instrumental music tracks. 21
yongxuustc/dcase2017_task4_cvssp A system for audio classification and detection using machine learning models 4
drscotthawley/audio-classifier-keras-cnn An audio classification system using a convolutional neural network to classify audio data 160
soerenab/audiomnist This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. 347
ibm/max-audio-embedding-generator An audio embedding model generator 56
ibm/max-scene-classifier An image classification model for recognizing physical places and locations 41
ibm/max-sports-video-classifier This project provides a pre-trained video classification model that categorizes sports videos into their respective sports. 23
ibm/max-speech-to-text-converter Converts spoken words into text form using speech recognition technology 76
mlachmish/musicgenreclassification Classify music genre from a 10-second sound stream using a neural network. 562
keunwoochoi/auralisation Reconstructs audio features learned by convolutional neural networks into audible sounds 42
iver56/audiomentations Library for audio data augmentation used in machine learning 1,873
cpjku/madmom A Python audio signal processing library used in music information retrieval tasks. 1,347
microsoft/pengi An Audio Language Model framework that uses transfer learning to generate text from audio inputs 290
bytedance/salmonn A large language model enabling speech, audio event perception and music inputs to achieve multilingual capabilities 1,053
audiojs/audio A JavaScript class for manipulating audio data 240