MAX-Audio-Embedding-Generator

Embedder

An audio embedding model generator

Generate embedding vectors from audio files

57 stars

26 watching

30 forks

Language: Python

last commit: about 3 years ago

Linked from 1 awesome list

docker-imagemachine-learningmachine-learning-modelstensorflow

Backlinks from these awesome lists:

victorshinya/awesome-ibmcloud

Related projects:

Repository	Description	Stars
ibm/max-audio-sample-generator	A tool to generate audio samples based on input commands and lo-fi instrumental music tracks.	22
ibm/max-audio-classifier	Identifies sounds in short audio clips using machine learning and PCA transformation	154
ibm/max-text-summarizer	Provides a pre-trained text summarization model that can be deployed as a web service in a Docker container	27
ibm/max-image-caption-generator	An image caption generation system utilizing machine learning models and deep neural networks.	84
ibm/max-news-text-generator	Generates English-language text similar to news articles using machine learning and natural language processing techniques.	26
ibm/max-speech-to-text-converter	Converts spoken words into text form using speech recognition technology	76
microsoft/pengi	An Audio Language Model framework that uses transfer learning to generate text from audio inputs	295
superkogito/pydiogment	A Python library for generating multiple audio files based on a starting mono audio file with various effects such as speed change, tone alteration and noise addition.	83
ibm/max-review-text-generator	Generates English-language text similar to Yelp reviews using a Char-RNN model	17
soroushmehr/samplernn_iclr2017	An unconditional end-to-end neural audio generation model utilizing a recurrent neural network architecture.	537
iver56/audiomentations	Library for audio data augmentation used in machine learning	1,903
ibm/max-fast-neural-style-transfer	A service for generating new images by mixing the content of an input image with the style of another image.	51
oborchers/fast_sentence_embeddings	A Python library for efficiently computing sentence embeddings from large datasets	618
ibm/max-question-answering	An open source question answering system built on top of the BERT model and deployed as a web service in a Docker container.	33
sdatkinson/neural-amp-modeler	Emulates guitar amplifier sound using machine learning models trained on audio data	1,883