audio-pretrained-model

Audio models

A collection of pre-trained audio and speech models for various applications

A collection of Audio and Speech pre-trained models.

GitHub

182 stars
4 watching
24 forks
last commit: over 4 years ago
Linked from 1 awesome list

audioaudio-processingcaffekeraskeras-modelskeras-tensorflowmachine-learningmxnetneural-networkpre-trainedpre-trained-modelpre-trainingpython3pytorchpytorch-modelsspeech-recognitionspeech-to-texttensorflowtensorflow-models

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
balavenkatesh3322/nlp-pretrained-model A collection of pre-trained natural language processing models 170
microsoft/pengi An Audio Language Model framework that uses transfer learning to generate text from audio inputs 290
yuangongnd/ltu An audio and speech large language model implementation with pre-trained models, datasets, and inference options 385
keras-team/keras-hub Provides pre-trained models and building blocks for natural language processing, computer vision, audio, and multimodal tasks 797
zfturbo/zf_unet_224_pretrained_model A pre-trained convolutional neural network model for image segmentation tasks. 214
keunwoochoi/kapre A Python library providing pre-built audio preprocessing layers for use in machine learning models. 922
zhuiyitechnology/pretrained-models A collection of pre-trained language models for natural language processing tasks 987
qwenlm/qwen-audio A multimodal audio language model developed by Alibaba Cloud that supports various tasks and languages 1,486
keunwoochoi/auralisation Reconstructs audio features learned by convolutional neural networks into audible sounds 42
drscotthawley/audio-classifier-keras-cnn An audio classification system using a convolutional neural network to classify audio data 160
awni/speech A PyTorch implementation of end-to-end speech recognition models. 754
bytedance/salmonn A large language model enabling speech, audio event perception and music inputs to achieve multilingual capabilities 1,053
qwenlm/qwen2-audio An audio-language model that can analyze or respond to speech instructions based on audio input 1,229
shubham-shahh/open-source-models An archive of pre-trained computer vision models. 61
soerenab/audiomnist This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. 347