MAX-Audio-Embedding-Generator
Embedder
An audio embedding model generator
Generate embedding vectors from audio files
56 stars
26 watching
30 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
docker-imagemachine-learningmachine-learning-modelstensorflow
Related projects:
Repository | Description | Stars |
---|---|---|
ibm/max-audio-sample-generator | A tool to generate audio samples based on input commands and lo-fi instrumental music tracks. | 21 |
ibm/max-audio-classifier | Identifies sounds in short audio clips using machine learning and PCA transformation | 153 |
ibm/max-text-summarizer | Provides a pre-trained text summarization model that can be deployed as a web service in a Docker container | 27 |
ibm/max-image-caption-generator | An image caption generation system utilizing machine learning models and deep neural networks. | 84 |
ibm/max-news-text-generator | Generates English-language text similar to news articles using machine learning and natural language processing techniques. | 26 |
ibm/max-speech-to-text-converter | Converts spoken words into text form using speech recognition technology | 76 |
microsoft/pengi | An Audio Language Model framework that uses transfer learning to generate text from audio inputs | 290 |
superkogito/pydiogment | A Python library for generating multiple audio files based on a starting mono audio file with various effects such as speed change, tone alteration and noise addition. | 83 |
ibm/max-review-text-generator | Generates English-language text similar to Yelp reviews using a Char-RNN model | 16 |
soroushmehr/samplernn_iclr2017 | An unconditional end-to-end neural audio generation model utilizing a recurrent neural network architecture. | 537 |
iver56/audiomentations | Library for audio data augmentation used in machine learning | 1,873 |
ibm/max-fast-neural-style-transfer | A service for generating new images by mixing the content of an input image with the style of another image. | 50 |
oborchers/fast_sentence_embeddings | A Python library for efficiently computing sentence embeddings from large datasets | 618 |
ibm/max-question-answering | An open source question answering system built on top of the BERT model and deployed as a web service in a Docker container. | 33 |
sdatkinson/neural-amp-modeler | Emulates guitar amplifier sound using machine learning models trained on audio data | 1,857 |