MAX-Audio-Classifier

Audio classifier

Identifies sounds in short audio clips using machine learning and PCA transformation

Identify sounds in short audio clips

GitHub

154 stars

29 watching

53 forks

Language: Python

last commit: about 2 years ago

Linked from 1 awesome list

audio-classificationdocker-imagekeras-tensorflowmachine-learningmachine-learning-models

Screenshot of IBM/MAX-Audio-Classifier website

developer.ibm.com/exchanges/models/all/max-audio-classifier/

Backlinks from these awesome lists:

victorshinya/awesome-ibmcloud

Related projects:

Repository	Description	Stars
ibm/max-audio-sample-generator	A tool to generate audio samples based on input commands and lo-fi instrumental music tracks.	22
yongxuustc/dcase2017_task4_cvssp	A system for audio classification and detection using machine learning models	4
drscotthawley/audio-classifier-keras-cnn	An audio classification system using a convolutional neural network to classify audio data	160
soerenab/audiomnist	This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques.	351
ibm/max-audio-embedding-generator	An audio embedding model generator	57
ibm/max-scene-classifier	An image classification model for recognizing physical places and locations	41
ibm/max-sports-video-classifier	This project provides a pre-trained video classification model that categorizes sports videos into their respective sports.	23
ibm/max-speech-to-text-converter	Converts spoken words into text form using speech recognition technology	76
mlachmish/musicgenreclassification	Classify music genre from a 10-second sound stream using a neural network.	565
keunwoochoi/auralisation	Reconstructs audio features learned by convolutional neural networks into audible sounds	42
iver56/audiomentations	Library for audio data augmentation used in machine learning	1,903
cpjku/madmom	A Python audio signal processing library used in music information retrieval tasks.	1,366
microsoft/pengi	An Audio Language Model framework that uses transfer learning to generate text from audio inputs	295
bytedance/salmonn	A large language model enabling speech, audio event perception and music inputs to achieve multilingual capabilities	1,091
audiojs/audio	A JavaScript class for manipulating audio data	240