audio-pretrained-model
Audio models
A collection of pre-trained audio and speech models for various applications
A collection of Audio and Speech pre-trained models.
182 stars
4 watching
24 forks
last commit: over 4 years ago
Linked from 1 awesome list
audioaudio-processingcaffekeraskeras-modelskeras-tensorflowmachine-learningmxnetneural-networkpre-trainedpre-trained-modelpre-trainingpython3pytorchpytorch-modelsspeech-recognitionspeech-to-texttensorflowtensorflow-models
Related projects:
Repository | Description | Stars |
---|---|---|
balavenkatesh3322/nlp-pretrained-model | A collection of pre-trained natural language processing models | 170 |
microsoft/pengi | An Audio Language Model framework that uses transfer learning to generate text from audio inputs | 290 |
yuangongnd/ltu | An audio and speech large language model implementation with pre-trained models, datasets, and inference options | 385 |
keras-team/keras-hub | Provides pre-trained models and building blocks for natural language processing, computer vision, audio, and multimodal tasks | 797 |
zfturbo/zf_unet_224_pretrained_model | A pre-trained convolutional neural network model for image segmentation tasks. | 214 |
keunwoochoi/kapre | A Python library providing pre-built audio preprocessing layers for use in machine learning models. | 922 |
zhuiyitechnology/pretrained-models | A collection of pre-trained language models for natural language processing tasks | 987 |
qwenlm/qwen-audio | A multimodal audio language model developed by Alibaba Cloud that supports various tasks and languages | 1,486 |
keunwoochoi/auralisation | Reconstructs audio features learned by convolutional neural networks into audible sounds | 42 |
drscotthawley/audio-classifier-keras-cnn | An audio classification system using a convolutional neural network to classify audio data | 160 |
awni/speech | A PyTorch implementation of end-to-end speech recognition models. | 754 |
bytedance/salmonn | A large language model enabling speech, audio event perception and music inputs to achieve multilingual capabilities | 1,053 |
qwenlm/qwen2-audio | An audio-language model that can analyze or respond to speech instructions based on audio input | 1,229 |
shubham-shahh/open-source-models | An archive of pre-trained computer vision models. | 61 |
soerenab/audiomnist | This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. | 347 |