MAX-Audio-Embedding-Generator

Embedder

An audio embedding model generator

Generate embedding vectors from audio files

GitHub

56 stars
26 watching
30 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list

docker-imagemachine-learningmachine-learning-modelstensorflow

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ibm/max-audio-sample-generator A tool to generate audio samples based on input commands and lo-fi instrumental music tracks. 21
ibm/max-audio-classifier Identifies sounds in short audio clips using machine learning and PCA transformation 153
ibm/max-text-summarizer Provides a pre-trained text summarization model that can be deployed as a web service in a Docker container 27
ibm/max-image-caption-generator An image caption generation system utilizing machine learning models and deep neural networks. 84
ibm/max-news-text-generator Generates English-language text similar to news articles using machine learning and natural language processing techniques. 26
ibm/max-speech-to-text-converter Converts spoken words into text form using speech recognition technology 76
microsoft/pengi An Audio Language Model framework that uses transfer learning to generate text from audio inputs 290
superkogito/pydiogment A Python library for generating multiple audio files based on a starting mono audio file with various effects such as speed change, tone alteration and noise addition. 83
ibm/max-review-text-generator Generates English-language text similar to Yelp reviews using a Char-RNN model 16
soroushmehr/samplernn_iclr2017 An unconditional end-to-end neural audio generation model utilizing a recurrent neural network architecture. 537
iver56/audiomentations Library for audio data augmentation used in machine learning 1,873
ibm/max-fast-neural-style-transfer A service for generating new images by mixing the content of an input image with the style of another image. 50
oborchers/fast_sentence_embeddings A Python library for efficiently computing sentence embeddings from large datasets 618
ibm/max-question-answering An open source question answering system built on top of the BERT model and deployed as a web service in a Docker container. 33
sdatkinson/neural-amp-modeler Emulates guitar amplifier sound using machine learning models trained on audio data 1,857