wav2vec-2.0
Speech Transcription Engine
Enables speech-to-text transcription using a pre-trained neural network model in MATLAB.
This repo provides the pretrained baseline 960 hours wav2vec 2.0 model in MATLAB.
8 stars
6 watching
4 forks
last commit: over 2 years ago audiodeep-learningmatlabmatlab-deep-learningpretrained-modelsspeech-to-text
Related projects:
Repository | Description | Stars |
---|---|---|
matlab-deep-learning/deepspeech | Enables speech-to-text transcription using a pre-trained Deep Speech model in MATLAB. | 7 |
matlab-deep-learning/pretrained-deeplabv3plus | Provides pre-trained and customizable semantic segmentation model in MATLAB | 23 |
matlab-deep-learning/pretrained-yolo-v4 | Pretrained deep learning object detection model for image analysis in MATLAB | 47 |
veenveenveen/speechsignalprocessingcourse | This project provides a collection of MATLAB source code examples for learning speech signal processing techniques | 66 |
balavenkatesh3322/audio-pretrained-model | A collection of pre-trained audio and speech models for various applications | 182 |
matlab-deep-learning/transformer-models | An implementation of deep learning transformer models in MATLAB | 206 |
vefstathiou/so_word2vec | This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. | 40 |
trekhleb/machine-learning-octave | A repository providing MatLab/Octave examples and explanations of popular machine learning algorithms | 852 |
apache/tvm-vta | A comprehensive hardware design stack for accelerating deep learning models | 254 |
picovoice/rhino | A deep learning-based speech-to-intent engine for on-device voice interaction | 629 |
peak1995/speech-enhancement-dsp | This repository provides MATLAB implementations of traditional speech enhancement techniques including spectral subtraction, Wiener filtering, and Kalman filtering. | 82 |
auspicious3000/contentvec | An implementation of a self-supervised speech representation model using PyTorch and disentangled speaker embeddings | 467 |
visenger/handson-ml | Teaches Machine Learning fundamentals in Python using Scikit-Learn and TensorFlow | 6 |
matlab-deep-learning/pretrained-salsanext | Provides pre-trained deep learning model for semantic segmentation of 3D point clouds using SalsaNext architecture | 14 |
huckiyang/voice2series-reprogramming | An approach to reprogramming acoustic models for time series classification using differential mel-spectrograms and adversarial training | 69 |