audio-pretrained-model

Audio models

A collection of pre-trained audio and speech models for various applications

A collection of Audio and Speech pre-trained models.

GitHub

183 stars

5 watching

26 forks

last commit: about 5 years ago

Linked from 1 awesome list

audioaudio-processingcaffekeraskeras-modelskeras-tensorflowmachine-learningmxnetneural-networkpre-trainedpre-trained-modelpre-trainingpython3pytorchpytorch-modelsspeech-recognitionspeech-to-texttensorflowtensorflow-models

Backlinks from these awesome lists:

balavenkatesh3322/cv-pretrained-model

Related projects:

Repository	Description	Stars
balavenkatesh3322/nlp-pretrained-model	A collection of pre-trained natural language processing models	170
microsoft/pengi	An Audio Language Model framework that uses transfer learning to generate text from audio inputs	295
yuangongnd/ltu	An audio and speech large language model implementation with pre-trained models, datasets, and inference options	396
keras-team/keras-hub	A unified interface to various deep learning architectures	818
zfturbo/zf_unet_224_pretrained_model	A pre-trained convolutional neural network model for image segmentation tasks.	214
keunwoochoi/kapre	A Python library providing pre-built audio preprocessing layers for use in machine learning models.	924
zhuiyitechnology/pretrained-models	A collection of pre-trained language models for natural language processing tasks	989
qwenlm/qwen-audio	A multimodal audio language model developed by Alibaba Cloud that supports various tasks and languages	1,515
keunwoochoi/auralisation	Reconstructs audio features learned by convolutional neural networks into audible sounds	42
drscotthawley/audio-classifier-keras-cnn	An audio classification system using a convolutional neural network to classify audio data	160
awni/speech	A PyTorch implementation of end-to-end speech recognition models.	756
bytedance/salmonn	A large language model enabling speech, audio event perception and music inputs to achieve multilingual capabilities	1,091
qwenlm/qwen2-audio	An audio-language model that can analyze or respond to speech instructions based on audio input	1,306
shubham-shahh/open-source-models	An archive of pre-trained computer vision models.	62
soerenab/audiomnist	This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques.	351