CLAP

Audio embedder

A library for learning audio embeddings from text and audio data using contrastive language-audio pretraining

Contrastive Language-Audio Pretraining

1k stars

29 watching

142 forks

Language: Python

last commit: 8 months ago

Linked from 1 awesome list

Screenshot of LAION-AI/CLAP website

arxiv.org/abs/2211.06687

Backlinks from these awesome lists:

amrzv/awesome-colab-notebooks

Related projects:

Repository	Description	Stars
free-audio/clap	Provides a standard interface for Digital Audio Workstations and audio plugins to work together	1,821
iver56/audiomentations	Library for audio data augmentation used in machine learning	1,903
keunwoochoi/auralisation	Reconstructs audio features learned by convolutional neural networks into audible sounds	42
andrewrk/libsoundio	A C library providing cross-platform audio input and output abstraction	1,958
laion-ai/clip_benchmark	Evaluates and compares the performance of various CLIP-like models on different tasks and datasets.	632
larpon/miniaudio	Provides a C-based interface to the miniaudio audio library.	50
asonge/erlaudio	A set of Erlang libraries that provide bindings to PortAudio for audio processing	26
faceperceiver/laion-face	Provides pre-trained face detection and analysis models using large-scale image-text data	281
soerenab/audiomnist	This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques.	351
ddiakopoulos/libnyquist	A C++ library for decoding and playing various audio formats	543
britalmeida/push_to_talk	An add-on for Blender's Sequencer to record audio with a push-to-talk functionality	51
belangeo/cecilia5	An audio processing toolbox with a graphical interface and built-in modules for sound effects and synthesis.	228
labsound/labsound	An audio engine that provides a graph-based API for processing and analyzing audio signals	735
laion-ai/laion-datasets	A repository containing a collection of large datasets used for training and testing AI models, specifically designed to improve image-text matching capabilities.	239
mackron/dr_libs	A collection of single-source audio decoding and loading libraries for C/C++.	1,285