CLAP
Audio embedder
A library for learning audio embeddings from text and audio data using contrastive language-audio pretraining
Contrastive Language-Audio Pretraining
1k stars
29 watching
137 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
free-audio/clap | Provides a standard interface for Digital Audio Workstations and audio plugins to work together | 1,806 |
iver56/audiomentations | Library for audio data augmentation used in machine learning | 1,873 |
keunwoochoi/auralisation | Reconstructs audio features learned by convolutional neural networks into audible sounds | 42 |
andrewrk/libsoundio | A C library providing cross-platform audio input and output abstraction | 1,946 |
laion-ai/clip_benchmark | Evaluates and compares the performance of various CLIP-like models on different tasks and datasets. | 615 |
larpon/miniaudio | Provides a C-based interface to the miniaudio audio library. | 50 |
asonge/erlaudio | A set of Erlang libraries that provide bindings to PortAudio for audio processing | 26 |
faceperceiver/laion-face | Provides pre-trained face detection and analysis models using large-scale image-text data | 278 |
soerenab/audiomnist | This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. | 347 |
ddiakopoulos/libnyquist | A C++ library for decoding and playing various audio formats | 539 |
britalmeida/push_to_talk | An add-on for Blender's Sequencer to record audio with a push-to-talk functionality | 49 |
belangeo/cecilia5 | An audio processing toolbox with a graphical interface and built-in modules for sound effects and synthesis. | 226 |
labsound/labsound | An audio engine that provides a graph-based API for processing and analyzing audio signals | 732 |
laion-ai/laion-datasets | A repository containing a collection of large datasets used for training and testing AI models, specifically designed to improve image-text matching capabilities. | 235 |
mackron/dr_libs | A collection of single-source audio decoding and loading libraries for C/C++. | 1,269 |